i 



(19) 



3 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



iiniiuigiiii 



(12) 



(n) EP 1 334 979 A1 

EUROPEAN PATENT APPLICATION 



(43) Date of publication: 

13.08.2003 Bulletin 2003/33 

(21) Application number: 02075565.8 

(22) Date of filing. 08.02.2002 



(51) lntCl7: C07K 14/415, C12N 15/82, 
C07K 16/16, G01N 33/50, 
C12N 5/10, A01H 1/04 



(84) Designated Contracting States: 

AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 
MC NL PT SE TR 

Designated Extension States: 
AL LT LV MK RO Si 

(71) Applicant: Kweek-en Research bed r I jf A grlco B.V. 
8314 PP Bant(NL) 

(72) Inventors: 

• van der Vossen, Edwin Andries Gerard 
3572 ZM Utrecht (NL) 

• Allefs, Josephus Jacobus Hendricus Maria 
8304 EJ Emmeloord (NL) 



(74) Representative: Prins, Adrianus Willem et al 
Vereenigde, 
Nieuwe Parklaan 97 
2587 BN Den Haag (NL) 

Remarks: 



The sequence fisting, which is published as annex 
to the application documents, was filed after the date 
of filing. The applicant has declared that it does not 
include matter which goes beyond the content of the 
application as filed. 



(54) Gene conferring resistance to Phytophthera infestans (late-blight) in Solanaceae 



(57) The invention relates to the field of plant dis- 
eases, in particular to oomycete infections such as late 
blight, a disease of major importance to production of 
Solanacae such as potato and tomato cultivars. The in- 
vention provides a method for providing a plant or its 
progeny with resistance against an oomycete infection 



comprising providing said plant or part thereof with a 
gene or functional fragment thereof comprising a nucleic 
acid, said nucleic acid encoding a gene product that is 
capable of providing a member of the Solanacae with 
resistance against an oomycete fungus. 



CT> 

*r 

CO 
CO 



LU 



Printed by Jouve. 75001 PARIS (FR) 



BEST AVAILABLE COPY 



i 



EP 1 334 979 A1 



Description 



[0001] The invention relates to the field of plant diseases. 

[0002] Late blight, caused by the oomycete pathogen Phytophthora infestans is world-wide the most destructive 
5 disease for potato cultivation. The disease also threatens the tomato crop. The urgency of obtaining resistant cultivars 
has intensified as more virulent, crop-specialised and pesticide resistant strains of the pathogen are rapidly emerging. 
[0003] A way to prevent crop failures or reduced yields is the application of fungicides that prevent or cure an infection 
by P infestans. However, the application of crop protectants is widely considered to be a burden for the environment. 
Thus, in several Western countries, legislation is becoming more restrictive and partly prohibitive to the application of 
io specific fungicides, making chemical control of the disease more difficult. An alternative approach is the use of cultivars 
that harbour partial or complete resistance to late blight. Two types of resistance to late blight have been described 
and used in potato breeding. One kind is conferred by a series of major, dominant genes that render the host incom- 
patible with specific races of the pathogen (race specific resistance). Eleven such R genes (R1 -R1 1) have been iden- 
tified and are believed to have originated in the wild potato species Soianum demissum, which is native to Mexico, 
is where the greatest genetic variation of the pathogen is found. Several of these R genes have been mapped on the 
genetic map of potato. R1 and R2 are located on chromosomes 5 and 4, respectively. R3, R6and R7are located on 
chromosome 11 . Unknown R genes conferring race specific resistance to late blight have also been described in S. 
tuberosum ssp. andigena and S. berthaultii. Because of the high level of resistance and ease of transfer, many cultivars 
contain S. demissum derived resistance. Unfortunately, S. demissum derived race specific resistance, although nearly 
20 complete, is not durable. Once newly bred cultivars are grown on larger scale in commercial fields, new virulences 
emerge in P infestans that render the pathogen able to overcome the introgressed resistance. The second type of 
resistance, often quantitative in nature, is race non-specific and is thought to be more durable. Race non-specific 
resistances late blight can be found in several Mexican and Middle and South American Soianum species. 
[0004] Diploid S. buibocastanum from Mexico and Guatemala is one of the tuber bearing species that is known for 
25 its race non-specific resistance to late blight. Despite differences in endosperm balance numbers, introgression of the 
S. buibocastanum resistance trait has been successful. Ploidy manipulations and a series of tedious bridge crosses 
has resulted in S. buibocastanum derived, P. infestans resistant germplasm. However, almost 40 years after the first 
crosses and intense and continuous breeding efforts by potato breeders in the Netherlands with this germplasm, late 
blight resistant cultivars still remain to be introduced on the market. Successful production of somatic hybrids of S. 
so buibocastanum and S. tuberosum has also been reported. Some of these hybrids and backcrossed germplasm were 
found to be highly resistant to late blight, even under extreme disease pressure. Despite reports of suppression of 
recombination, resistance in the backcrossed material appeared to be on chromosome 8 within an approximately 6 
cM interval between the RFLP markers CP53 and CT64. A CAPS marker derived from the tomato RFLP probe CT88 
cosegregated with resistance. Suppression of recombination between the S. buibocastanum and S. tuberosum chro- 
35 mosomes forms a potential obstacle for successful reconstitution of the recurrent cultivated potato germplasm to a 
level that could meet the standards for newly bred potato cultivars. Isolation of the genes that code for resistance found 
in S. buibocastanum and subsequent transformation of existing cultivars with these genes, would be a much more 
straight forward and quicker approach when compared to introgression breeding. 

[0005] The cloning and molecular characterisation of numerous plant R genes conferring disease resistance to bac- 
40 teria, fungi, viruses, nematodes, and insects has identified several structural features characteristic to plant R genes. 
The majority are members of tightly linked multigene families and all R genes characterised so far, with the exception 
of Pto, encode leucine-rich repeats (LRRs), structures shown to be involved in protein-protein interactions. LRR con- 
taining R genes can be divided into two classes based on the presence of a putative tripartite nucleotide-binding site 
(NBS). R genes of the NBS-LRR class comprise motifs that are shared with animal apoptosis regulatory proteins. The 
45 second class of LRR containing R genes encompasses genes with a predicted hydrophobic membrane-anchoring 
domain with a predicted extracellular N-terminal LRR motif. The recently cloned resistance gene R1 conferring race 
specific resistance to late blight belongs to the NBS-LRR class of R genes. 

[0006] The invention provides an isolated or recombinant nucleic acid essentially corresponding to a cluster of genes 
identifiable by phylogenetic tree analyses, preferably of the encoded amino acid sequence, for example when com- 
50 paring functionalities, as corresponding to the Rpi-blb, RGC1-blb, RGC3-blb and RGC4-blb gene cluster (herein also 
called the Rpi-blb gene cluster) of figure 9. 

[0007] Phylogenetic tree analysis is carried out as follows. First a multiple sequence alignment is made of the nucletc 
acid sequences an/or preferably of the deduced amino acid sequences of the genes to be analysed using CLUSTALW 
(http://www2.ebi.ac.uk/clustalw), which is in standard use in the art. CiustalW produces a dnd file, which can be read 
ss by TREEVIEW (http:/Aaxonomy.20ology.gla.ac.uk/rod/rod.html). The phylogenetic tree depicted in Figure 9 is a phy- 
logram. 

[0008] Phylogenetic studies of the deduced amino acid sequences of Rpi-blb, RGC1-blb, RGC3-bIb, RGC4-blb and 
those of the most similar genes from the art (as defined by the BLASTX) derived from diverse species, using the 
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Neighbour-Joining method of Saitou and Nei (1987), shows that corresponding genes or functional fragments thereof 
of the Rpi-blb gene cluster can be placed in a separate branch (Figure 9). 

[0009] Seq uence comparisons between the four members of the Rpi-blb gene cluster identified on 8005-8 B AC clone 
SPB4 show that sequence homology within the Rpi-blb gene cluster varies between 70% and 81% at the amino acid 

5 sequence level, providing a convenient rule of thumb: a nucleic acid encoding a peptide of at least 15 amino acids, 
preferably of at least 25 amino acids, most preferably of at least 50 amino acids, having at least 70% homology to 
corresponding stretches of peptides selected from any of the proteins encoded by the Rpi-blb, RGCI-blb, RGC3-blb 
and RGC4-blb genes are provided as functional fragment, herewith. The deduced amino acid sequence of Rpi-blb 
shares the highest overall homology with RGC3-blb (81% amino-acid sequence identity; Table 4). When the different 

10 domains are compared it is dear that the effector domains present in the N-terminal halves of the proteins (coiled-coil 
and NBS-ARC domains) share a higher degree of homology (91% sequence identity) than the C-terminal halves of 
these proteins which are thought to contain the recognition domains (LRRs; 71% amino acid sequence identity). Com- 
parison of all four amino-acid sequences revealed a total of 104 Rpi-blb specific amino acid residues (Figure 10). The 
majority of these are located in the LRR region (80/104). Within the latter region, these specific residues are concen- 

15 trated in the LRR subdomain xxLxLxxxx. The relative frequency of these specific amino-acid residues within this LRR 
subdomain is more than two times higher (28.3%) than that observed in the rest of the LRR domain (12.3%). The 
residues positioned around the two conserved leucine residues in the consensus xxLxxLxxxx are thought to be solvent 
exposed and are therefore likely to be involved in creating/maintaining recognition specificity of the resistance protein. 
[0010] Sequences of additional members of the Rpi-blb gene cluster can be obtained by screening genomic DNA 

20 or insert libraries, e.g. BAC libraries with primers based on signature sequences of the Rpi-blb gene. Screening of 
various Solanum BAC libraries with primer sets A and/or B (Table 2 and Figure 7) identified numerous Rpi-blb homo- 
logues derived from different Solanum species. Alignment of these additional sequences with those presented in Figure 
1 0 will help identify additional members of the Rpi-blb gene cluster and specific amino acid residues therein responsible 
for R infestans resistance specificity. Furthermore, testing additional sequences in the above described phylogenetic 

25 tree analyses, e.g. using the Neighbour-Joining method of Saitou and Nei (1987), provides additional identification of 
genes belonging to the Rpi-blb gene cluster. 

[0011] The invention provides the development of an intraspecific mapping population of S. bufbocastanum that 
segregated for race non-specific resistance to late blight. The resistance was mapped on chromosome 8, in a region 
located 0.3 cM distal from CT88. Due to the race non-specific nature of the resistance, S. bufbocastanum late blight 
so resistance has always been thought to be R gene independent. However, with the current invention we demonstrate 
for the first time that S. bufbocastanum race non-specific resistance is in fact conferred by a gene bearing similarity to 
an R gene of the NBS-LRR type. 

[0012] The invention further provides the molecular analysis of this genomic region and the isolation by map:based 
cloning of a DNA-fragment of the resistant parent that harbours an R gene, designated Rpi-bfb. This DNA-fragment 

35 was subcloned from an approximately 80 kb bacterial artificial chromosome (BAC) clone which contained fourcomplete 
R gene-like sequences in a cluster-like arrangement. Transformation of a susceptible potato cultivarby Agrobacterium 
tumefaciens revealed that one of the four R gene- like sequences corresponds to Rpi-bfb that provides the race non- 
specific resistance to late blight. Characterisation of the Rpi-bfb gene showed that it is a member of the NBS-LRR class 
of plant R genes. The closest functionally characterised sequences of the prior art are members of the 12 resistance 

40 gene family in tomato. These sequences have an overall amino acid sequence identity of approximately 32% with that 
of Rpi-blb. 

[0013] Thus, in a first embodiment, the invention provides an isolated or recombinant nucleic acid, said nucleic acid 
encoding a gene product that is capable of providing a member of the Solanacae family with race non-specific resistance 
against an oomycete pathogen. 
45 [0014] Isolation of the gene as provided here that codes for the desired resistance trait against late blight and sub- 
sequent transformation of existing potato and tomato cultivars with this gene now provides a much more straightforward 
and quicker approach when compared to introgression breeding. The results provided here offer possibilities to further 
study the molecular basis of the plant pathogen interaction, the ecological role of R genes in a wild Mexican potato 
species and are useful for development of resistant potato or tomato cultivars by means of genetic modification. 
[0015] In contrast to the R genes cloned and described so far, the gene we provide here is the first isolated R gene 
from a Solanum species that provides race non-specific resistance against an oomycete pathogen. Notably, the inven- 
tion provides here a nucleic acid wherein said Solanum species that is provided with the desired resistance comprises 
S. tuberosum. In particular, it is the first gene that has been isolated from a phylogenetically distinct relative of cultivated 
potato, S. bulbocastanum, for which it was shown by complementation assays, that it is functional in S. tuberosum. 
55 These data imply that the gene Rpi-blb can easily be applied in potato production without a need for time-consuming 
and complex introgression breeding. 

[0016] The following definitions are provided for terms used in the description and examples that follow. 
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Nucleic acid: a double or single stranded DNA or RNA molecule. 
Oligonucleotide: a short single-stranded nucleic acid molecule. 

Primer the term primer refers to an oligonucleotide that can prime the synthesis of nucleic acid. 
Homology: homology may be defined and determined by the TBLASTN or TBLASTP program for nucleic acid or 
amino acid sequences, respectively, of Altschul era/. (1990), which is in standard use in the art or. and I this i may 
be preferred, the standard program BestFit, which is part of the Wisconsin Package, Version 8, September 1994. 
(Genetics Computer Group. 575 Science Drive. Madison, Wisconsin, USA, Wisconsin 53711). Homology may be 
at the nucleotide sequence and/or encoded amino acid sequence level. Preferably the nucleic acic land/or ammo 
acid sequence shares at least 50%. or 60% homology, most preferably at least about 70%, or 80% or 90% homology 
with the sequence as depicted in Fig. 6. As shown in Table 4. the closest functionally characterised sequence of 
the prior art (members of the 12 Fusarium resistance gene cluster in tomato) has a much lower level of amino acid 
sequence identity than this (32% with respect to that of Rpi-blb). Homology within the R gene cluster of the present 
invention varies between 70% and 81% at the amino acid sequence level. Alternatively, a sequence ,s defined as 
belonging to the same cluster when numerous sequences are compared according to the Neighbour-Joining meth- 
od of Saitou and Nei (1987). , L „„ , r 
Promoter: the term "promoter" is intended to mean a short DNA sequence to wh.ch RNA polymerase and/or other 
transcription initiation factors bind prior to transcription of the DNA to which the promoter is functionally connected, 
allowing transcription to take place. The promoter is usually situated upstream (5 1 ) of the coding sequence. In .ts 
broader scope, the term "promoter includes the RNA polymerase binding site as well as regulatory sequence 
elements located within several hundreds of base pairs, occasionally even further away, from the transcnp ion 
start site Such regulatory sequences are, e.g., sequences that are involved in the binding of protein factors that 
control the effectiveness of transcription initiation in response to physiological conditions. The promoter region 
should be functional in the host cell and preferably corresponds to the natural promoter region of the Rpi-blb 
resistance gene. However, any heterologous promoter region can be used as long as it is functional in the host 
cell where expression is desired. The heterologous promoter can be either constitutive or regulable. Aconstrtutrve 
promoter such as the CaMV 35S promoter or T-DNA promoters, all well known to those skilled in the art, .s a 
promoter which is subjected to substantially no regulation such as induction or repression, but which allows for a 
steady and substantially unchanged transcription of the DNA sequence to which it is functionally bound ,n al active 
cells of the organism provided that other requirements for the transcription to take place is fulfilled. A regulable 
promoter is a promoter of which the function is regulated by one or more factors. These factors may either be such 
which by their presence ensure expression of the relevant DNA sequence or may, alternatively, be such which 
suppress the expression of the DNA sequence so that their absence causes the DNA sequence to be expressed. 
Thus the promoter and optionally its associated regulatory sequence may be activated by the presence or absence 
of one or more factors to affect transcription of the DNA sequences of the genetic construct of the invention. 
Suitable promoter sequences and means for obtaining an increased transcription and expression are known to 

SnatoMhe inscription terminator serves to terminate the transcription of the DNA into RNA and is preferably 
selected from the group consisting of plant transcription terminator sequences, bacterial transcnption terminator 
sequences and plant virus terminator sequences known to those skilled in the art. 

Gene- the term "gene" is used to indicate a DNA sequence which is involved in producing a polypeptide chain and 
which includes regions preceding and following the coding region (5'-upstream and 3'-downstream sequences) as 
well as intervening sequences, the so-called introns, which are placed between individual coding segments (so- 
called exons) or in the 5'-upstream or 3'-downstream region. The 5'-upstream region may comprise a regulatory 
sequence that controls the expression of the gene, typically a promoter. The 3'-downstream region may comprise 
sequences which are involved in termination of transcription of the gene and optionally sequences responsible for 
polyadenylation of the transcript and the 3" untranslated region. The term "resistance gene" is an isolated nucleic 
acid according to the invention said nucleic acid encoding a gene product that is capable of providing a member 
of the Solanacae family with resistance against an oomycete pathogen, said nucleic acid preferably comprising a 
sequence as depicted in Fig. 6 or part thereof, or a homologous sequence with essentially similar functional and 
structural characteristics. A functionally equivalent fragment of such a resistance gene or nucleic acid as provided 
by the invention encodes a fragment of a polypeptide having an amino acid sequence as depicted in Fig. 8 or part 
thereof or a homologous and/or functionally equivalent polypeptide, said fragment exhibiting the charactenstic of 
providing at least partial resistance to an oomycete infection such as caused by P. infestans when incorporated 
and expressed in a plant or plant cell. 
. Resistance gene product: a polypeptide having an amino acid sequence as depicted in Fig. 8 or part thereof, or 
a homologous and/or functionally equivalent polypeptide exhibiting the characteristic of providing at least partia 
resistance to an oomycete infection such as caused by P. infestans when incorporated and expressed in a plant 
or plant cell. 
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Ro plant: primary regenerant from a transformation experiment, also denoted as transformed plant or transgenic 
plant. 

[001 7] In the present invention we have identified and isolated the resistance gene Rpi-blb, which confers race non- 
specific resistance to Phytophthora infestans. The gene was cloned from a Solanum bu/bocastanum genotype that is 
resistant to R infestans. The isolated resistance gene according to the invention can be transferred to a susceptible 
host plant using Agrobacterium mediated transformation or any other known transformation method, and is involved 
in conferring the host plant resistant to plant pathogens, especially R infestans. The host plant can be potato, tomato 
or any other plant, in particular a member of the Sofanacae family that may be infected by such a plant pathogen. The 
present invention provides also a nucleic acid sequence comprising the Rpi-blb gene, or a functionally equivalent 
fragment thereof, which is depicted in Figure 6. 

[0018] With the Rp'hbib resistance gene or functionally equivalent fragment thereof according to the invention, one 
has an effective means of control against plant pathogens, since the gene can be used for transforming susceptible 
plant genotypes thereby producing genetically transformed plants having a reduced susceptibility or being preferably 
resistant to a plant pathogen. In particular, a plant genetically transformed with the Rpibfb resistance gene according 
to the invention has a reduced susceptibility to R infestans. 

[0019] In a preferred embodiment the Rpi-blb resistance gene comprises the coding sequence provided in Figure 
6B or any corresponding or homologous sequence preceded by a promoter region and/or followed by a terminator 
region. The promoter region should be functional in plant cells, and preferably correspond to the native promoter region 
of the Rpi-bibgene. However, a heterologous promoter region that is functional in plant cells can be used in conjunction 
with the coding sequences. 

[0020] In addition the invention relates to the Rpi-bib resistance gene product which is encoded by the Rpi-blb gene 
according to the invention and which has an amino acid sequence provided in Figure 8, or which is homologous to the 
deduced amino acid sequence or part thereof. 

[0021] The invention also provides a vector comprising a nucleic acid as provided herein, said nucleic acid encoding 
a gene product that is capable of providing a member of the Solanacae family with resistance against an oomycete 
pathogen, or a functionally equivalent isolated or recombinant nucleic acid in particular wherein said member comprises 
S. tuberosum or Lycopersicon esculentum. 

[0022] The invention also provides a host cell comprising a nucleic acid or a vector according to the invention. An 
example of said host cell is provided in the detailed description herein. In a particular embodiment, said host cell 
comprises a plant cell. As a plant cell a cell derived from a member of the Solanacae family is preferred, in particular 
wherein said member comprises S. tuberosum or Lycopersicon esculentum. From such a ceil, or protoplast, a trans- 
genic plant, such as transgenic potato plant or tomato plant with resistance against an oomycete infection can arise. 
The invention thus also provides a plant, or tuber root, fruit or seed or part or progeny derived thereof comprising a 
cell according to the invention. 

[0023] Furthermore, the invention provides a proteinaceous substance, exhibiting the characteristic of providing at 
least partial resistance to an oomycete infection such as caused by R infestans when incorporated and expressed in 
a plant or plant cell. In particular such a proteinaceous substance is provided that is encoded by a nucleic acid according 
to the invention. In a preferred embodiment, the invention provides a proteinaceous substance comprising an amino 
acid sequence as depicted in figure 8 or part thereof. Such a proteinaceous substance is for example useful for obtaining 
a binding molecule directed at said substance. Particular easy to obtain, merely by immunizing an appropriate animal 
and harvesting a polyclonal serum or a monoclonal antibody, are antibodies or fragments thereof, but other binding 
molecules such as synthetic antibodies or peptide mimics thereof can for example be obtained by phage display meth- 
ods. 

[0024] Furthermore, the invention provides a binding molecule directed at a nucleic acid according to the invention. 
For example, the Rpi-blb gene can be used for the design of oligonucleotides complementary to one strand of the DNA 
sequence as depicted in Figure 7 and Table 2. Such oligonucleotides as provided herein are useful as probes for library 
screening, hybridisation probes for Southern/Northern analysis, primers for PCR, for use in a diagnostic kit for the 
detection of disease resistance and so on. Such oligonucleotides are useful fragments of an isolated or recombinant 
nucleic acid as provided herein, said nucleic acid encoding a gene product that is capable of providing a member of 
the Solanacae family with resistance against an oomycete fungus, or a functionally equivalent isolated or recombinant 
nucleic acid, in particular wherein said member comprises S. tuberosum or Lycopersicon esculentum. They can be 
easily selected from a sequence as depicted in figure 6 or part thereof. A particular point of recognition comprises the 
LRR domain as identified herein. Such a binding molecule according to the invention is used as a probe or primer, for 
example provided with a label, in particular wherein said label comprises an excitable moiety which makes it useful to 
detect the presence of said binding molecule. 

[0025] The invention furthermore provides a method for selecting a plant or plant material or progeny thereof for its 
susceptibility or resistance to an oomycete infection comprising testing at least part of said plant or plant material or 
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of providing a member of the folanacae 'amitywrt "J*™* a 9 y , am or |ant material or progeny 

oomyceteinfec.ionjnparticularwhereinsa.doomyceecompr^es^ P efe 

S. tuberosum, said method for providing > • jJ^S^l JSSCS^ "™P risi "9 a ™ C,eiC 
infection comprising providing sa,d plant or pa^hereo 'J^^ a member of the Solanacae fami.y with 

Z^^^^^^ - S. b^casfanum, or part thereof, such as a tuber or see., 

susceptible to an oomycete infection caused by P. infestans. 

[0028] The invention is further described in the detailed descnpt.cn below. 

DESCRIPTION OF THE FIGURES 

[0029] 

accessions. 

SSK35SSSS3SSSSSS 

genres) THigh density genetic linkage map of the Rp/-b/b locus (n=2109 genotypes). 

the horizontal line indicate the number of recomo.nanis loeni Relative positions of candidate 

ess :;:r^=^™^.„ u »c. p — - p— - » 

678-nucleotide long intron sequence, 
blotting are indicated. 

potato cultivar Impala through Agrotoacteriuro mediated transformation. 
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and 743-1365). 
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Figure 7. Relative primer positions. The horizontal bar represents the coding sequence of the Rpi-blb gene. Num- 
bers represent nucleotide positions. Horizontal arrows indicate relative primer positions and orientations. GSP1 
and GSP2 represent nested gene specific primers used for 3' RACE experiments. GSP3 and GSP4 represent 
nested gene specific primers used for 5 l RACE experiments. A(F), A(R), B(F) and B(R) are primers used to amplify 
ftp/'-fc/fchomologues. The position of the restriction site A/s/I used to make domain swaps between Rpi-blb homo- 
logues is indicated. 

Figure 8. Deduced Rpi-blb protein sequence. The amino acid sequence deduced from the DNA sequence of Rpi- 
blb \s divided into three domains (A-C), as described in the text. Hydrophobic residues in domain A that form the 
first and fourth residues of heptad repeats of potential coiled-coil domains are underlined. Conserved motifs in R 
proteins are written in lowercase and in italic in domain B. Residues matching the consensus of the cytoplasmic 
LRR are indicated in bold in domain C. Dots in the sequence have been introduced to align the sequence to the 
*5 consensus LRR sequence of cytoplasmic LRRs. 

Figure 9. Phylogenetic tree of state of the art sequences which share some homology to the deduced amino acid 
sequence of Rpi-blb and its gene cluster members RGC1-blb, RGC3-blb and RGC4-bfb. The tree was made ac- 
cording to the Neighbour-Joining method of Saitou and Nei (1987). An asterix indicates that the gene has been 
20 assigned a function. The Rpi-btb gene cluster is boxed. 

Figure 10. Alignment of the deduced protein products encoded by Rpi-blb, RGCI-bfb, RGC3-bfband RGC4-blb. 
The complete amino acid sequence of Rpi-blb is shown and amino acid residues from RGC1-blb t RGC3-blb and 
RGC4-blb that differ from the corresponding residue in Rpi-blb. Dashes indicate gaps inserted to maintain optimal 
alignment. Amino acid residues that are specific for Rpi-blb, when compared to those at corresponding positions 
in RGCI-blb, RGC3-blb and RGC4-blb, are highlighted in bold. The regions of the LRRs that correspond to the 
consensus L..L..L.L..C/N/S..a..aP are underlined. Conserved motifs in the NBS domain are indicated in lowercase. 
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Figure 11. Schematic overview of domain swaps made between Rpi-blb and homologues RGCI-blb and 
RGC3-blb. The vertical dotted line indicates the position of the Nsh site used to make the swaps. R and S indicate 
whether transgenic plants harbouring specific chimeric constructs are resistant or susceptible to late blight infec- 
tion, respectively. 

Detailed description 

[0030] For the mapping of the Rpi-blb resistance gene an intraspecific mapping population of S. bulbocastanum. was 
developed. A crucial step in this process was the identification of susceptible S. bulbocastanum genotypes. For this 
purpose several S. bulbocastanum accessions originating from different clusters/areas in Mexico were analysed for P 
infestans resistance or susceptibility in a detached leaf assay (Table 1 and Figure 1). The screened accessions BGRC 
8008 and BGRC 7999 contained no susceptible genotypes. However in the accessions BGRC 8005, BGRC 8006 and 
BGRC 7997 r susceptibility was found in 9%, 7% and 14 % of the analysed seedlings, respectively. A R infestans 
susceptible clone of accession BGRC 8006 was subsequently selected and crossed with a resistant clone of accession 
BGRC 8005. The resulting F1 population was used to map the Rpi-blb locus and is hereafter referred to as the B8 
population. 

[0031] Initial screening of 42 BB genotypes for resistance to R infestans in a detached leaf assay suggested that R 
infestans resistance in S. bulbocastanum accession 8005 could be caused by a single dominant R gene, or a tightly 
linked gene cluster. Of the 42 genotypes tested, 22 scored resistant and 16 susceptible in a repeated experiment. 
Resistance phenotypes of the remaining 4 seedlings remained unclear. In order to determine the chromosome position 
of this S. bulbocastanum resistance, B8 genotypes with an undoubted phenotype were used for marker analysis. The 
chromosome 8 specific marker TG330 (Table 2) was found to be linked in repulsion phase with the resistant phenotype, 
as only one recombinant was obtained between this marker and resistance in 12 B8 genotypes. Furthermore, chro- 
mosome 8 marker CT88 (Table 2) was found to be completely linked in repulsion phase to resistance, indicating that 
the locus responsible for resistance, designated Rpi-blb, was located in this region of chromosome 8. For this reason, 
tomato chromosome 8 specific markers that map proximal and distal to CT88 (TG513 and CT64; Tanksley et al., 1992; 
Table 2) were developed into CAPS markers and tested in 512 B8 genotypes with known resistance phenotypes. A 
total of five CT64-CT88 recombinant genotypes and 41 CT88-TG513 recombinant genotypes were identified in this 
screen (Figure 2A). The resistance locus Rpi-blb was mapped 1 recombination event distal to marker CT88 (Figure 2A). 
[0032] Fine mapping of the Rpi-blb\oc\j$ was carried out with CAPS markers derived from left (L) and right (R) border 
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sequences of BAC clones isolated from a BAC library prepared from the resistant S. bulbocastanum genotype BGRC 
8005-8 The BAC library was initially screened with markers CT88 and CT64. BAC clones identified with these markers 
were used as seed BACs for a subsequent chromosome walk to the Rpi-blb locus. A total of 21 09 B8 genotypes were 
screened for recombination between markers TG513 en CT64. All recombinant genotypes (219/2109) were subse- 
quently screened with all available markers in the CT88-CT64 genetic Interval. These data together wrth the disease 
resistance data of each recombinant, obtained through detached leaf assays, positioned the Rpi-blb locus between 
markers SPB33L and B149R, a 0.1 cM genetic interval (4/21 09 recombinants) physically spanned by the overlapping 
BAC clones SPB4 and B49 (Figures 2b and 3). Within this interval resistance cosegregated with the BAC end marker 
SPB42L the sequence of which shared homology to the Fusarium 12 gene cluster from tomato (Ori et al., 1 997, Simons 
et al 1 998) Southern analyses of BAC clones spanning the SP33L-B149R interval using a 32p-labeled PCR fragment 
of marker SPB42L as a probe revealed the presence of at least 4 copies of this Ft gene like sequence wrthm the Rpi- 
bib interval (Figure 4). Moreover, all of these copies were present on BAC SPB4. Sequencing and annotation of the 
complete insert of this BAC clone indeed identified fourcomplete R gene candidates (RGC1-blb, RGC2-blb,RGC3-blb 
and RGC4-blb) of the NBS-LRR class of plant Ft genes. A PCR-marker that was located in-between RGd-blb and 
RGC4-blb revealed recombination between P. infestans resistance and RGC4-olb, ruling out the possibility of RGC4-blb 
beinq Rpi-blb. Despite this finding, all four RGCs were selected for complementation analysis. 
[0033] Genomic fragments of approximately 10 kb harbouring RGd-blb, RGC2-blb, RGC3-blb or RGC4-Wbwere 
subcloned from BAC SPB4 into the binary plant transformation vector pBINPLUS (van Engelen et al., 1 995) and trans- 
ferred to a susceptible potato cultivar using standard transformation methods. Primary transformants were tested for 
P infestans resistance as described in Example 1 . Only the genetic construct harbouring RGC2-blb was able to com- 
plement the susceptible phenotype; 86% of the R 0 (RGC2-b/to) plants were resistant (Table 3) whereas al \ RGd-blb 
RGC3-blb and RGC4-blb containing primary transformants were completely susceptible to P. infestans. The resistant 
RGC2-blb containingtransformants showed similar resistance phenotypes as the S. bulbocastanum resistant parent 
(Figure 5). RGC2-blbwas therefore designated the Rpi-blb gene, the DNA sequence of which is provided in Figure 6. 

EXAMPLE 1: DISEASE ASSAY 

[0034] The phenotype of S. bulbocastanum and transgenic S. tuberosum genotypes for resistance to P. infestans 
was determined by detached leaf assays. Leaves from plants grown for 6 to 12 weeks in the greenhouse were placed 
in pieces of water-saturated florists foam, approximately 35x4x4 cm, and put in a tray (40 cm width, 60 cm length and 
6 cm height) with a perforated bottom. Each leaf was inoculated with two droplets or more (25 u.l each) of sporanglospore 
solution on the abaxial side. Subsequently, the tray was placed in a plastic bag on top of a tray, in which a water- 
saturated filter paper was placed, and incubated in a climate room at 17°C and a 16h/8h day/night photoperiod with 
fluorescent light (Philips TLD50W/84HF). After 6 days, the leaves were evaluated for the development of P. infestans 
disease symptoms. Plants with leaves that clearly showed sporulating lesions 6 days after inoculation were considered 
to have a susceptible phenotype whereas plants with leaves showing no visible symptoms or necrosis at the side of 
inoculation in the absence of clear spoliation were considered to be resistant. The assay was performed with P. 
infestans complex isolate 655-2A, which was obtained from Plant Research International BV (Wagen.ngen, The Neth- 
erlands). 

EXAMPLE 2: MAPPING OF THE Rpi-blb RESISTANCE LOCUS 
Plant material 

[0035] In order to produce an intraspecific mapping population that segregated for the P. infestans resistance gene 
present in S bulbocastanum accession BGRC 8005 (CGN 17692, PI 275193), a susceptible S. bulbocastanum gen- 
otvpe was required Several S. bulbocastanum accessions originating from different clusters/areas in Mexico were 
analysed for P infestans resistance or susceptibility in a detached leaf assay (Table 1 and Figure 1). In accession 
BGRC 8008 and BGRC 7999 no susceptibility was detected. In accession BGRC 8005, BGRC 8006 and BGRC 7997 
susceptibility was only present in 9%, 7% and 1 4 % of the analysed seedlings, respectively. Thus, only afew susceptible 
S. bulbocastanum genotypes were obtained. 

[0036] The intraspecific mapping population of S. bulbocastanum (B8) was produced by crossing a P. infestans sus- 
ceptible clone of accession BGRC 8006 with a resistant clone of accession BGRC 8005. DNA of 2109 progeny plants 
was extracted from young leaves according to Doyle and Doyle (1 989). 

CAPS marker analysis 

[0037] For PCR analysis, 15 ill reaction mixtures were prepared containing 0.5 ng DNA, 15 ng of each primer, 0.2 
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mM of each dNTP, 0.6 units Taq -polymerase (15 U/u.1, SphaeroQ, Leiden, The Netherlands), 10 mM Tris-HCI pH 9, 
1 .5 mM MgClg, 50 mM KCI, 0.1 % Triton X-1 00 and 0.01 % (w/v) gelatin. The PCRs were performed using the following 
cycle profile: 25 seconds DNA denaturation at 94°C, 30 seconds annealing (see Table 1) and 40 seconds elongation 
at 72°C. As a first step in PCR-amplification DNA was denatured for 5 min at 94°C and finalised by an extra 5 min 
elongation step at 72°C. The amplification reactions were performed in a Biometra® T-Gradient or Biometra® Uno-ll 
thermocycler(Westburg, Leusden, The Netherlands). Depending on the marker, the PCR product was digested with 
an appropriate restriction enzyme. An overview of the markers including primer sequences, annealing temperature 
and restriction enzymes, is given in Table 2. Subsequently, the (digested) PCR products were analysed by electro- 
phoresis in agarose or acrylamide gels. For acrylamide gel analysis, the CleanGei DNA Analysis Kit and DNA Silver 
Staining Kit (Amersham Pharmacia Biotech Benelux, Roosendaal, the Netherlands) were used. 

Genetic mapping of the Rpi-blb locus 

[0038J Initially a small group of 42 progeny plants of the B8 population was screened for resistance to P. infestans 
in a detached leaf assay. Plants with leaves that clearly showed sporulating lesions 6 days after inoculation were 
considered to have a susceptible phenotype whereas plants with leaves showing no visible symptoms or necrosis at 
the side of inoculation in the absence of clear sporulation were considered to be resistant. Of the 42 seedlings, 22 
scored resistant and 1 6 susceptible. The phenotype of the remaining 4 seedlings remained unclear in this initial phase. 
These data indicated that resistance could be due to a single dominant gene or a tightly linked gene cluster. In order 
to determine the chromosome position, seedlings with a reliable phenotype were used for marker analysis. Chromo- 
some 8 marker TG330 was found to be linked in repulsion with the resistant phenotype, as only one recombinant was 
obtained between this marker and resistance in 12 B8 seedlings. Furthermore, chromosome 8 marker CT8B was found 
to be completely linked in repulsion phase to resistance, indicating that a resistance gene was located on chromosome 
8. 

[0039] Subsequently, chromosome 8 specific markers that had been mapped proximal and distal to CT88 (Tanksley 
et al., 1 992) were developed to CAPS markers. In order to map these markers more precisely, another 512 individuals 
of the B8 population were screened for late blight resistance using the detached leaf disease assay. Simultaneously 
plants were scored for the markers CT64, CT88 and TG513. For 5 seedlings, recombination was detected between 
markers CT64 and CT88, while 41 seedlings were recombinant between markers CT88 andTG513 (Figure 2A). The 
resistance gene Rpi-blb was mapped in between markers CT64 and CT88. In this stage, the positioning of CT88 
proximal to Rpi-blb was based on only one recombined seedling. 

[0040] In order to determine the position of Rpi-blb more precisely relative to the available markers, another 1555 
seedlings of the B8 population were grown and analysed for recombination between the markers TG513 and CT64. 
Thus, a total of 2109 individual offspring clones of the B8 population were screened. Recombination between markers 
TG51 3 en CT64 was detected in 21 9 of these seedlings (1 0.4 cM). All of the recombinants were screened with marker 
CT88 and phenotyped for the resistance trait by making use of the detached leaf assay. In agreement with earlier 
results, the Rpi-blb gene was mapped in between markers CT88 and CT64 (Figure 2B). 

EXAMPLE 3: CONSTRUCTION OF A S. BULBOCASTANUM BAC LIBRARY AND CONSTRUCTION OF A 
CONTIGUOUS BAC CONTIG SPANNING THE Rphblb LOCUS 

BAC library construction 

[0041] A resistant clone of S. bulbocastanum (bib) accession BGRC 8005 (CGN 1 7692, PI 2751 93) heterozygous 
for the Rpi-blb locus, was used as source DNA for the construction of a genomic BAC library, hereafter referred to as 
the 8005-8 BAC library. High molecular weight DNA preparation and BAC library construction were carried out as 
described in Rouppe van der Voort et al. (1999). Approximately 130.000 clones with an average insert size of 100 kb, 
which corresponds to 15 genome equivalents were finally obtained. A total of approximately 83.000 individual clones 
were stored in 216 384-well microtiter plates (Invitrogen, The Netherlands) containing LB freezing buffer (36 mM 
K 2 HP0 4 , 13.2 mM KH 2 P0 4 , 1.7 mM citrate, 0.4 mM MgS0 4 , 6.8 mM (NH 4 ) 2 S0 4 , 4.4 % V/V glycerol, 12.5 u.g/ml 
chloramphenicol in LB medium) at -80°C. Another 50.000 clones were stored as bacterial pools containing -1000 
white colonies. These were generated by scraping the colonies from the agar plates into LB medium containing 18% 
glycerol and 12.5 u.g/ml chloramphenicol using a sterile glass spreader. These so-called super pools were also stored 
at -80°C. 

Screening of the BAC library and construction of a physical map of the Rpi-blb locus 

[0042] The 8005-8 BAC library was initially screened with CAPS markers CT88 and CT64. This was carried out as 
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h » 37«C oTe elate wts used to scrape the 24 colonies of each row together and the other plate was used to scrape 

wmamm 

pools carry the prefix SP (e.g jSPeM). BAC clones were analysed by isolating plasmid 

revealed tha t BAC 1 39 earned the CT88 a ^ ^ ^ ^ o( ^ ^ (R) and |eft (L) ds 

of Snslrt end s'eqrncfng was carried out as described in Example 4 using 0.5 N of BAC DNA as template^ 
piomhte CApl marKers S developed by digesting the PGR products of the two parent genotypes of the B8 

BAC clone &FB4 coraain recombinant panel with B149R revealed that this BAC spanned the Bpi-blb 

^^i^Z^^^S^^ by'two recombination events (Figure 3A). Screening of the 8005-8 
t^S^SSSSSZ BAC clone B49 as having the B149R allele that was ^^J^T^T 
BAC ctone together with BAC clone SPB4 therefore formed a BAC contig that spanned the Rprtlb .ocus (F.gure 3). 

EXAMPLE 4: SEQUENCE ANALYSIS OF BAC SPB4 AND IDENTIFICATION OF RESISTANCE GENE CANDIDATES 
WITHIN THE Rpi-blb LOCUS 

r„ n «, xA/ thin ,h B <?PB33L-B1 49R interval resistance cosegregated with BAC end marker SPB42L, the sequence of 

tp !Sn7«n PmsE sontorec r ISO sonicator. After ethanol precipitation and resuspension in 20 pi TE the ends of the DNA 
fra£^ 

polymerase l^^ n "J°*^ J f**" i% seaPlaque LMP agarose gel (FMC). The fraction with a size of 1 .5-2.5 
sequently separated by e.ec t^ojesis or 1 *^ ^ ^ s&d > s|ices were tnen transferred 

T a TsT^ZoT£ m e« Sffi^e^SSS. 1 - - QEuSe (Epicentre Technologies, USA) 
per 1 00 mg oTagarosl^ for ! hr at 45»C. and the DNA was subsequently precipitated. The 1 .5-2.5 Kb fragments 
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were ligated at 1 6°C in a EcoRV restricted and dephosphorylated pBluescript SK + vector (Stratagene Inc.). The ligation 
mixture was subsequently used to transform ElectroMAX E. co/i DH10B competent cells (Life Technologies, UK) by 
electroporation using the BioRad Gene Pulser. Settings on the BioRad Gene Pulser were as recommended for E. co/i 
by the manufacturer. The cells were spread on Luria broth (LB) agar plates containing ampicillin (1 00 u.g/ml), 5-bromo- 
4-chloro-3-indolyl-p-D-galactoside (Xgal) (64 ng/ml) and isopropyl-1-thio-p-D-galactoside (IPTG) (32 u.g/ml). Plates 
were incubated at 37°C for 24 hours. Individual white colonies were grown in 96-well flat-bottom blocks (1 .5 ml Terrific 
Broth medium containing 1 00 ng/ml ampicillin). 

[0046] Plasmid DNA was isolated using the QIAprep 96 Turbo Miniprep system in conjunction with the BioRobot™ 
9600 (QIAGEN) according to the manufacturers instructions. Sequencing reactions were performed using ABI PRISM 
BigDye™ Terminator cycle sequencing kit (Stratagene) according to the manufacturer's instructions. Ail clones were 
sequenced bi-directionally using universal primers. Sequence products were separated by capillary electrophoresis 
on a Perkin Elmer ABI 3700 DNA Analyzer. 

[0047] The automated assembly of the shotgun reads was carried out using the Phred-Phrap programs (Ewing and 
Green, 1998; Ewing et at. t 1 998). A total of 835 reads provided an overall BAC sequence coverage equal to 5x. Gaps 
between contigs were closed by primer walking or through a combinatorial PCR approach. The sequence was finally 
edited at Phred quality 40 (1 error every 1 0,000 nt) by manual inspection of the assembly using the Gap4 contig editor 
and re-sequencing of all low-quality regions. The complete sequence of the insert of BAC SPB4 consisted of 77,283 
nucleotides. 

[0048] Analysis of the contiguous sequence of BAC SPB4 using the computer programme GENSCAN (Burge and 
Karlin, 1997), GENEMARK (Lukashin and Borodovsky, 1998) and BLASTX (Altschul et a/., 1990) identified four com- 
plete /?gene candidate sequences {RGCI-blb, RGC2-b/b, RGC3-b/b and RGC4-b/b) belonging to the NBS-LRR class 
of plant R genes. A CAPS marker designed in between RGC1-blb and RGC4-blb, marker RGC1-4 revealed recombi- 
nation between P infestans resistance and RGC4-blb, ruling put the possibility of RGC4-b/b being Rphbtb (Figure 3A 
and B). Despite this finding, all four RGCs were selected for complementation analysis. 

EXAMPLE 5: COMPLEMENTATION ANALYSIS 

Subcloning of candidate genes and transformation to Agrobacterlum tumefaciens 

[0049] Genomic fragments of approximately 10 kb harbouring RGC1-bib, RGC2-btb, RGC3-blb or RGC4-b/b were 
subcloned from BAC clone SPB4 into the binary plant transformation vector pBINPLUS (van Engelen et al., 1995). 
Restriction enzyme digestion of BAC clone SPB4 DNA and subsequent size selection was carried out as follows. 
Aliquots of ~1 ng DNA were digested with 1U, 0.1 U or 0.01 U of Sau3M restriction enzym for 30 min. The partially 
digested BAC DNA was subjected to CHEF electrophoresis at 4°C in 0.5 X TBE using a linear increasing pulse time 
of 1-10 sec and a field strength of 6 V/cm for 1 6 hr. After electrophoresis, the agarose gel was stained with ethidium 
bromide to locate the region of the gel containing DNA fragments of approximately 1 0kb in size. This region was excised 
from the gel using a glass coverslip and dialysed against 50 ml TE for 2 hr at 4°C. Dialysed agarose slices were then 
transferred to a 1.5 ml Eppendorf tube, melted at 70°C for 5 min and digested with 1 unit of GELASE (Epicentre 
Technologies, USA) per 1 00 mg of agarose gel for 1 hr at 45°C. Ligation of the size selected DNA to BamH I -digested 
and dephosphorylated pBINPLUS and subsequent transformation of ElectroMAX E. coli DH10B competent cells (Life 
Technologies, UK) with the ligated DNA was carried as described in Example 5, using the BioRad Gene Pulser for 
electroporation. The cells were spread on Luria broth (LB) agar plates containing kanamycin (50 u,g/ml), Xgal (64 ng/ 
ml) and IPTG (32 u.g/ml). Plates were incubated at 37°C for 24 hours. Individual white colonies were grown in 96-well 
plates (1 00 uJ LB medium containing 50 u.g/ml kanamycin). A total of 480 clones were PCR screenep* for the presence 
of RGCs using primers SPB42LF and SPB42LR or RGC4F and RGC4R (Table 2.). Positive clones were selected for 
plasmid isolation and further characterisation. Identification of clones harbouring RGCI-blb, RGC2-b/b r RGC3-blb or 
RGC4-blb was carried out by sequencing the SPB42L PCR fragments derived from positive clones. The relative position 
of the RGCs within a subclone was determined by sequencing the ends of the clone and subsequent comparison of 
the sequences to the complete BAC insert sequence. Finally four binary plasmids, pRGC1 -bib, pRGC2-blb, pRGC3-blb 
and pRGC4-blb were selected and transferred to Agrobacterium tumefaciens strains AGL0 (Lazo eta/., 1 991 ), LBA4404 
(Hoekema et a/., 1983) or UIA143 (Farrand et a/., 1989) either by electroporation using the BioRad Gene Pulser or by 
conjugation. Settings on the BioRad Gene Pulser were as recommended for A. tumefaciens by the manufacturer. 
Conjugation was carried out as described by Simon eta/. (1 983). The cells were spread on Luria broth (LB) agar plates 
containing kanamycin (100 mg/l) and rifampicin (50 mg/1). Plates were incubated at 28°C for 48 hours. Small-scale 
cultures from selected colonies were grown in LB medium containing kanamycin (100 mg/l) and rifampicin (50 mg/l). 
Plasmid DNA was isolated as described previously and the integrity of the plasmids was verified by restriction analysis 
upon reisolation from A. tumefaciens and subsequent transformation to E. co/i. A tumefaciens cultures harbouring a 
plasmid with the correct DNA pattern were used to transform a susceptible potato genotype. 
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Transformation of susceptible potato cultivar 

[0050] A tumefaciens strains were grown for 2 days at 28°C in 20 ml LB medium supplemented with SO mg/l rifampi- 
cine and 25 mg/l kanamycin. Subsequentiy, 0.2 mi of A tumefaciens culture was diluted in 1 0 ml LB medium contaWng 
me same antibiotics and grown overnight (28'C). The overnight cu.ture was centnf uged (30 i m,n 2 47 x g) and the 
pellet was resuspended in 50 ml MS medium (Murashige and Skoog, 1 962) supplemented with 30 g/1 sucrose (MS30V 
ro051l Certified seed potatoes of cultivar Impala were peeled and surface sterilised for 30 m.n. in a 1% sodium 
Ro^Tution containing 0.1 % Tween-20. Tubers were then washed thoroughly in large volumes of sterrie 
Sled water (4 times, 10 min). Discs of approximately 2 mm thickness and 7 mm in diameter, were sliced from 
SdersoTtubertissueprepared^ 

A tumefaciens and incubated for 15 min. After removing the A tumefaciens solution the "^"^J™^ 
to regeneration medium containing MS30, 0.9 mg/l IAA, 3.6 mg/l zeatine ribos.de and 8 g/l agar (Hoekema et al 1 989). 
TheTates were incubated at 24°C, 1 6 hour day-length (Philips TLD50W/84HF). After 48 hours of co-culhvaton, the 
Scs we re rinsed for 5 min in liquid MS medium including antibiotics, 200 mg/l vancomycin, 250 mg/l cefotax.m 
and 7 fm jl kanamycin. and transferred to regeneration medium supplemented with the same antibiotics. The plates 
wereTncuLed at 2>C 16 hour day-length (Philips TLD50W/84HF). Every three weeks, the tuber discs were trana- 
WnX resh medium. Regenerating shoo^ 

shoots were propagated in vitro and tested for absence of A. tumefaciens cells by incubating a p.ece of stem in 3 ml 
LB medium (3 weeks, 37°C. 400 rpm). One plant of each transformed regenerant was transferred to the greenhouse. 

Complementation of the susceptible phenotype In potato 

10052] Primary transformants were tested for P. infestans resistance as described J Example 1 Only the , genetic 
construct harbouring RGC2-bib was able to complement the suscepbble phenotype; 86% of the ^RGC2-blb plants 
were resistant (Table 3) whereas all RGCLblb, RGC3-blb and RGC4-blb containing primary transformants were com- 
X except ble to P. infestans. The resistant RGC2-b,b transformants showed similar resistance phenotypes as the 
SKSlreslstantparent(Rgure 5). RGCS-b/bwas therefore designated the ^-b/bgene.theDNAsequence 
of which is provided in Figure 6. 

Transformation of susceptible tomato 

[0053] Seeds of the susceptible tomato line Moneymaker were rinsed in 70% ethanol to dissolve the seed coat and 
washed with sterile water. Subsequently, the seeds were surface-sterilised in 1 .5% sodium hypochlonte for 15 m.nutes 
rinsedihree times in sterile water and placed in containers containing 140 ml MS medium pH f ° and 
Skoog, 1962) supplemented with 1 0 g/l sucrose (MS10) and 160 ml vermiculite. The seeds were left to gerrrrina e Tor 
8 days at 25°C and 0.5 W/M* light. Eight day old cotyledon explants were pre-cultured for 24 hours inPetn d.shes 
containina a two week old feeder layer of tobacco suspension cells plated on co-cult.vation med.um (MS30 pH 5.8 
su^nferwith Nitsch vitamines louchefa Biochemie BV. Haarfem. The Netherlands), 0.5 g/l MES buffer and 8 g/ 

rooSr Overnight cultures of A. tumefaciens were centrifuged and the pellet was resuspended in cel. suspension 
medium (MS30 P H 5 8 supplemented with Nitsch vitamines, 0.5 g/l MES buffer, pH 5.8) conta.n.ng 200 u.M acetosy- 
m£n?ta fTnaf oleoo o'f 0.25. The explants were then infected with the diluted overnight cufture ofA.tumefaaens 
stTn UIA143 (Farran^et a.., 1989) containing the helper plasmid pCH32 (Hamilton et al„ 1996) and pRGC2-blb for 
25 minutes, blotted dry on sterile filter paper and co-cultured for 48 hours on the original feeder layer plates. Culture 

^^"FoTowing S^ScStSvaTon, the cotyledons explants were transferred to Petri dishes with se ^ctive shoot 
nducing medium (MS pH 5.8 supplemented with 10 g/l glucose, including Nitsch vitamines 0 5 <^^* S J* 
agargel 2 mg/l zeatine riboside, 400 mg/l carbenicilline, 100 mg/l kanamyc.ne, 0.1 mg/l IAA) and cultured at 25 C with 
3 5 W/m2 light The explants were sub-cultured every 3 weeks onto fresh medium. Emerging shoots were dissected 
from the underlying callus and transferred to containers with selective root inducing medium (MS10 pH 5.8 1 supple- 
mented with NKsch vitamines, 0.5 g/l MES buffer. 5 g/l agargel, 0.25 mg/l IBA, 200 mg/l carbenic.ll.n and 100 mg/l 
kanamycine). 

Complementation of the susceptible phenotype in tomato 

rO0561 Primary transformants were tested for P. infestans resistance essentially as described in Example 1 for potato 
eaves except that a P. infestans isolate was used that is specific for tomato. The tomato isolate was obtained from 
Plant Research International BV (Wageningen, The Netherlands). In total 10 transformants containing an intact 
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RGC2-blb construct were tested for resistance to P infected tk ~, 

able to complement a susceptible tomato pheno^e 86386 r68,StanCe 3SSay revea,ed W-Ub is 

Molecular analysis of primary transformants 
RT-PCR analysis 

WOGOTACSTAACGGCATGAOAdTCm,,^iol«n« JE£V! 0,00 <m "*"»• «=TGTCAAci A . 
M 1 Ml «. .ransertp.ase (M-MW ^iS-TtZSSSi P,1L « ^T'™* "* wa * « «*<= 

Rap/d amplification of cDNA ends 

CATCTCCCAGAG) and GSP2 (S^TGCTT^^ (S'-GAGgUc 9 

EXAMPLE 6: STRUCTURE OF THE GENE AND THE CORRESPONDING PROTEIN. 

[0059] T| iesizeandstructureofthe/?p/-6/6qenewasdetermin 0 r(K„^ 

the insert of pRGC2-blb with cDNA fragments generated by JT£S ST" 3 ? 9en ° mi ° SeqUenCe derived f ™ 
6' and 3' Rpi-blb specific cDNA fragments of a sinq!e soedes resfJLT a ™P"«cat,on ofcDNA ends. RACE identified 
a single *M* specific transcript The cod nc Sue™ 2 'iTES? SU " eStin 9 that the 9*"°™ clone encodes 
» (nt). The gene contains a single intron S^^^SX* "° be 291 0 "decides 

gene (Figure 3C). starting 428 nt after the translational ATG start codon of the 

[0060] The deduced open readinq frame of tho «r>; h;*> „„„„ 

with an estimated molecular weight of 1 ToZkD (RqureB) SeveratT ^ a predic,ed P°'ypeptide of 970 amino acids 
class of plant R genes are apparent in IheencoZ^L ^T fUn ?' Ona ' m0,ifs Wt in Rgenes of the NBS-LRR 

« 8). The N-termina, par, of the' protein oon^p^ ' n, ° 3 d °™ inS < A - B a "« * Figure 

residues are general^ hydrophobic (domain A) ^ma?n B^aIt. »K re Peats in which the first and fourth 

NB-ARC domain (ARC for Apaf-1 . R p^efn and of R rS" ^ ^ ° ther m ° Ws that constitute «"e 

Biezen and Jones, 1998). This domain incfudes the Ap ATPaLe mSTo^ r T * (Va " der 

origin (Aravind et a/., 1999). The C-terminal half of Rri Tblb^w. P " Pr ° temS ° ,eul «ryotic a nd P rokaryotic 

« LRRs can be aiigned according ,o the eoSl^^^^SiS^ f° p 9 "'" LRRS (d0mai " C) The 
res.due and "a" designates the positions of aliphatic aTno aS ^ Slo^d^ll^ WhW8 * desi 9" a <e* ™Y 

format approximates the consensus for cytoplasmic LRRs (Jones SSSiSfT ^ ^ ^ 

^ EXAMPLE 7: NATURAL HOMOLOGUES AND ARTIFICIAL VARIANTS OF THE R phblb GENE 
Natural homologues 

[0061 ] BLASTX homology searches with the codinq seouenee of ih» 

homology with various state of the art genes does no^ceeo 36* *ZZ ^ ««»«nce 
*> score was obtained with an NBS-LRR gene derived X ^STSL?"?' (Tab ' e ^ ^ b6S ' BLASTX 
genes sharing an overall sequence homology of 27-36%^m7o j£l 2 T * set > uence ia ^). NBS-LRR 

others in Arabidopsis thaliana. P/rasec/us^u/g^ 

1997;Simonse,al. 1 998), Zea mays, *£^SL^?Sit?SSrS^tT^5r'. <JCi ~**" 9ene «*«<*: Ori et a. 
acid sequences of RGCf-b/*, flGcSSTKSSS E^T' °' d6dUCed ami "° 

» (as defined by BLASTX) derived Irom diverse species us^'he Ne^l > "°™«>90us state of the art genes 
shows that members of the Rpi-blb gene cluster can hi S Ne ' 9hbour - Jo,nln 9 ™thod of Saitou and Nei (1987), 
[0062] Sequence comparisons of ? h "our R gen cand SsTd! TT* 1*"°* 9) 
qoence homology within trie gen e cJ^^ISS 70% and' eTy a, me ^ ^ SPB4 Sh ° W that ^ 

een /u /o and 81 /o at the Q^'no acid sequence level. The 
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acid sequence identity) than the C-term.na l halves acid sequences revea led a total of 

mains (LRRs; 71% amino acid sequence o{ thesc are , 0 cated in the LRR region (80/104). 

T04 RpVbtb specific amino acid residues (F.gure 10) Jhe * subdoma in xxLxLxxxx. The relat.ve fre- 

the .alter region, these specific residues ^^^^^main ?s more than two times higher (28.3%) than 
Tency of these specific amino-acid res.dues w, h-n th s ^R R su^n ^ ^ ^ ed 

Lt observed in the ^J^^^^'^^ ^ "* * 
'^^r^s T r r -J^— - * screening genomic DNA or 

[00631 Sequences of additional homologues ol ^the Pp ^ ge ^ C uenceso(the flp ^ Wb gene. Screening of vanous 
nsert ibraries. e.g. BAG .ibraries with pnmers ; based on sign - W , 
Sofanum BAG libraries with primer sets A ' ^LL sequences with those presented in F.gure 10 

derived from different Solanum spec.es. Al, 9 nmen * ™ S6 ' *° sjdues tnere in responsible for P. infestans res.stance 
S help identify RpiM *»?°»»^^ Poetic tree analyses.^, us.ng 

Rpi-blb gene cluster. 
Artificial variants 

[0064] Ooma,nswapsbe W eenthe« 

„ P. niestans resistance. The restrict ^ ete LRR domain of Bpl-m with that of RGCf-Wb 
present in the conserved MHDmot.f ^^^^L, variants of the Rp/-Wb gene were made wh,ch 
or RGC3-Wt> using techniques known to those stalled in tnean^ RGC3-blb and visa versa, i.e., the N- 

encSe^he N-termina. haif of Rpi-blb and the C- emn.na haj of I ^ Thesevanants we re transformed 
Sna.ha«ofR G C1.b.borRGC3-b.bandthe^ 

by this part of the gene. 



Table 1 . 



35 



40 



45 



S. buibocastanum accession 



CGN 



17692 

17693 
17687 
17688 



BGRC 



275193 
275194 
275198 
243505 
255518 



plants 



susceptibi lity 



cluster 3 



a The .e«i a. b and = represent re.ative geographic* origins depicted .n Figure ■ 



50 



55 
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Table 2. Overview of markers used for mapping Rpi-blb 



Marker Ori» Sequence b 



Annealing Restriction 
temp (°C) enzyme c 



TG513 



TG330 



CT88 



CT64 



B139R 



SPB33L 



SPB42L 



B149R 



RGCl-4 



B 



F CGTAAACGCACCAAAAGCAG 58 71. 

R GATTCAAGCCAGGAACCGAG 

F CAGCTGCCACAGCTCAAGC 56 TaqI 

R TACCTACATGTACAGTACTGC 

F GGCAGAAGAGCTAGGAAGAG 57 Mbol 

R ATGGCGTGATACAATCCGAG 

F TTCAAGAGCTTGAAGACATAACA 60 a.s. 

R ATGGCGTGATACAATCCGAG 

F ACTAGAGGATAGATTCTTGG 56 Cfol 

R CTGGATGCCTTTCTCTATGT 

F GATCAGAAGTGCCTTGAACC 56 TaqI 

R CAAGGAGCTTGGTCAGCAG 

F ATTGCACAGGAGCAGATCTG 59 Hinfl 

R TGTAAG AGAG CAAG AGG C AC 

F AGAGCAGTCTTGAAGGTTGG 58 Cfol 

R GATGGTAACTAAGCCTCAGG 

F GACAGATTTCTCATAAACCTGC 58 Msel/Xbal 

R AATCGTGCATCACTAGAGCG 

F TGTGGAGTAAGAGAGGAAGG 62 Sspl/Msel 

R TCAGCTGAGCAGTGTGTGG 
F ATGG CTGAAG CTTTC ATTCAAGTT 60 
CTG 

R TCACACCGCTTGATCAGTTGTGGA 
C 

F TRCATGAYCTMATCCATGATTTGC 60 
R GMAATTTTGTGCCAGTCTTCTCC 



• Orientation of the primer, F: forward, R: reverse 
b primer sequences according to IUB codes 
e a.s.: allele specific. 
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Table 3. 



Phytophthora infestans resistance assays 



Genotype 3 



RGC-containing plants/ 
transformants 



R plants / RGC-containing plants 



R 0 (RGC1-blb) 
R 0 (RGC2-blb) 
Ro(RGC3- 
blb) 

R 0 (RGC4-blb) 



15/20 b 
7/31 c 
0/6 c 

14/21 d 
1/7 C 

18/19 b 



0/15 
6/7 

0/14 
0/1 

0/18 



a Rn oenotvpes are primary transformants obtained from transformation of the suscept 
thX^- -ndidates RGC1-blb, RGC2-t> lb . RGC3-t>lt>or «^ 
aid tor transformation of the P. infestans susceptible potato cultrvar Impala. Kan". K 



eotible potato cultivar Impala with aT-DNA constructs containing 
Kanamycin resistant. 



Table 4 



Comparison of nucleotide and amino acid sequence homology 



Rpi- 
bld 



nt a 



aa , 



8005-8 BAC SPB4 



RGC3- bib 



88 



81 



N b 
91 



Cb 
71 



RGC1-Wd 



84 



76 



N 
79 



C 
72 



RGC4-6W 



81 



70 



N 

75 



C 
66 



Rice 
RGC 



Arabidops 
is RGC 



37 



32 



Tomato 
I2C-1 



32 



a Percental nu^ 
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SEQUENCE LISTING 

110> Kweek- en researchbedrijf Agrico B.v. 
<120> The invention relates to the field of plant diseases 
<130> P58542EPOO 



<140> 0207S565.8 
<141> 2002-02-08 

<160> 44 

<170> Patentin ver. 2.1 

<210> 1 

<211> 9 

<212> PRT 

<213> Artificial sequence 



<223> Description of Artificial sequence: concentration 
in LRR subdomain 

<220> 

<221> SITE 

<223> /note= ,( x stands for any amino acid" 

xaa°xaa Leu xaa Leu xaa xaa xaa xaa 
1 5 

<210> 2 
<2U> 10 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial sequence: consensus 

<220> 

<221> SITE 

<223> /note-V stands for any amino acid" 
X^a°xaa Leu xaa xaa Leu Xaa xaa xaa xaa 

<210> 3 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: oligo d(T) 
primer 

<220> 

<221> misc_feature 
<222> (1)..C54) 

gc?gtcaacg atacgctacg taacggcatg acagtgtttt tttttttttt tttt 
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<210> 4 
<211> 20 
<212> ONA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer GSPl 
<220> 

<221> mi sc_f eature 
<222> (1)..(20) 

<400> 4 

gaggaatcca tctcccagag 

<210> 5 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer GSP2 
<220> 

<221> misc_feature 
<222> (1) . .(27) 

<400> 5 

gtgcttgaag agatgataat tcacgag 



<210> 6 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: primer GSP3 
<220> 

<221> mi sc_f eature 
<222> (1). .(21) 

<400> 6 

gtccatctca ccaagtagtg g 

<210> 7 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer GSP4 
<220> 

<221> misc_feature 
<222> (1). .(22) 

<400> 7 

gaaatgctca gtaactctct gg 



<210> 8 
<211> 20 
<212> DNA 
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X 



<213> Artificial Sequence 

<223> Description of Artificial Sequence: primer GSP5 
<220> 

<221> misc_feature 
<222> (D..C20) 

<400> 8 

ggaggactga aaggtgttgg 

<210> 9 
<211> 22 
<212> PRT 

<213> Artificial sequence 

^223> Description of Artificial Sequence: consensus 
<220> 

<221> SITE 

^n'po^tion 12 stands. for C/N or S X on position 
22 stands for aliphatic amino acid 

Leu°xaa xaa Leu xaa Xaa Leu xaa Leu xaa Xaa Xaa Xaa Xaa Leu Xaa 

1 5 
Xaa Leu Pro Xaa xaa Xaa 

<210> 10 

<211> 6 

<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence: Nsil-site 
<220> 

<221> misc_feature 
<222> (1)..(6) 

<400> 10 
atgcat 

<210> 11 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<M3> Description of Artificial Sequence: forward primer 

<220> 

<221> misc_feature 
<222> (1)..(20) 

<400> 11 

cgtaaacgca ccaaaagcag 
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<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: reverse primer 
<220> 

<221> mi sc_f eature 
<222> (1)..(20) 

<400> 12 

gattcaagcc aggaaccgag 



<210> 13 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: forward primer 
<220> 

<221> mi sc_f eature 
<222> (1). .(19) 

<400> 13 

cagctgccac agctcaagc 



<210> 14 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: reverse primer 
<220> 

<221> mi sc_f eature 

<222> CD.. (21) 

<400> 14 

tacctacatg tacagtactg c 



<210> 15 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: forward primer 
<220> 

<221> mi sc_f eature 

<222> (1)..(20) 

<400> 15 

ggcagaagag ctaggaagag 

<210> 16 
<211> 20 
<212> DNA 
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<213> Artificial sequence 

: 2 223> Description of Artificial sequence: reverse primer 

<220> 

<2 21> mi scfeature 
<222> (D..C20) 

<400> 16 

atggcgtgat acaatccgag 

<210> 17 

<211> 23 

<212> ONA . 

<213> Artificial sequence 

tilt Description of Artificial Sequence: forward primer 

<220> 

<221> miscfeature 
<222> CD • • (23) 

<400> 17 

ttcaagagct tgaagacata aca 

<210> 18 
<211> 20 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence: reverse primer 
<220> 

<221> misc_feature 
<222> CD - -(20) 

<400> 18 

atggcgtgat acaatccgag 

<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: forward primer 
<220> 

<221> misc^feature 
<222> CD.- (20) 

<400> 19 

actagaggat agattcttgg 

<210> 20 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

^223> Description of Artificial Sequence: reverse primer 
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<220> 

<221> misc_/feature 
<222> (1)..(20) 

<400> 20 

ctggatgcct ttctctatgt 



<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: forward primer 
<220> 

<221> misc_feature 
<222> (1) . . (20) 

<400> 21 

gatcagaagt gccttgaacc 



<210> 22 
<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: reverse primer 
<220> 

<221> mi sc__f eature 
<222> (1) . .(19) 

<400> 22 

caaggagctt ggtcagcag 



<210> 23 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: forward primer 
<220> 

<221> misc_feature 
<222> (1). . (20) 

<400> 23 

attgcacagg agcagatctg 



<210> 24 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: reverse primer 
<220> 

<221> misc_feature 
<222> (1). .(20) 
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<400> 24 

tgtaagagag caagaggcac 



<210> 2S 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 



<l\T> Description of Artificial Sequence: forward primer 



<220> 

<221> misc_feature 
<222> CD-.C20) 

<400> 25 

agagcagtct tgaaggttgg 



<210> 26 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 



^223> Description of Artificial Sequence: reverse primer 



<220> 

<221> misc_feature 
<222> (1)..(20) 

<400> 26 

gatggtaact aagcctcagg 



<210> 27 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
c220> 



<223> Description of Artificial sequence: forward primer 



<220> 

<221> misc^feature 
<222> (D-.C22) 

<400> 27 ^ 
gacagatttc tcataaacct gc 



<210> 28 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 



<223> Description of Artificial Sequence: reverse pnmer 
<220> 

<221> misc_feature 
<222> (X)-.(20) 

<400> 28 

aatcgtgcat cactagagcg 
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<210> 29 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: forward primer 
<220> 

<221> misc_feature 
<222> (1)..(20) 

<400> 29 

tgtggagtaa gagaggaagg 

<210> 30 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: reverse primer 
<220> 

<221> misc_feature 
<222> (1)..(19) 

<400> 30 

tcagctgagc agtgtgtgg 

<210> 31 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: forward primer 
<220> 

<22l> misc^feature 
<222> (1)..(27) 

<400> 31 

atggctgaag ctttcattca agttctg 



<210> 32 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: reverse primer 
<220> 

<221> misc^feature 
<222> (1)..(25) 

<400> 32 

tcacaccgct tgatcagttg tggac 

<210> 33 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 

lilt Description of Artificial Sequence: forward primer 
<220> 

<221> miscfeature 
<222> CD*. (24) 

<400> 33 ^ _ 24 

trcatgayct matccatgat ttgc 

<210> 34 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

: 2 223> Description of Artificial Sequence: reverse primer 
<220> 

<221> misc_feature 
<222> CD-- (23) 

<400> 34 23 
gmaattttgt gccagtcttc tec 

<210> 35 
<211> 2913 
<212> DNA 

<213> Solanum bul bocastanum 
<220> 

<221> misc_feature 
<222> CD..C2913) 
<223> /note= Rpi-blb 

» sssss FIIH iiii ilii IsISIHl lis 

attcaagecg tccttgaaga tg«caggag aagcaactca |«acaagc tgaa tataaa 240 
tggttgcaal aactcaatgc tgctacatat 9aagtcgatg acaxcTxgg a ct 300 

alcaaggeca caagattctc ccagtctgaa tatggccgtt ^atcc^ 9^ 3 go 

ttccgtcaca aggtegggaa aaggaxggac "*«^«"| q a qagacaagc tgttagacgg 420 
gaggaaagaa agaattttca tttgcacgaa aaaaxtgrag agag » giaagatgag 480 
gaalcaggtt ctgtattaac cgaaccgcag ftttatggaa Qagacaaag g y ^ ^ 
atagtgiaaa tcctaataaa caatgttagt Qatgcccaac accrxtcagx £ 600 
cttggtatgg 99ggattagg aaaaacgact "tgcccaaa tggtcttw. t 9 atga « aag 56 o 
grtactgagc atttccattc "aaatatgg ««g g c |?tacttgg tgagatggac 720 

sssse asss a ss««ss ssass s&ss 

ttagatgatg tttggaatga agatcaacag aagtgggcta attxaag g ay « att 900 
gttlgalcaa Qtggtgettc tgttctaacc actactcgtc ttgaaaagg t 99 gttgttg ^0 

SSKSS 9 9 t 9 - a SSKS SK238 8SS53SS 

SS8K3 3SSS8 I &«Aj8 ggggs ggg «g 

ttgcctcaag atgaaagttc jattctgect gecexgagg y c aaaatggaa 1260 

cttgatttga aacaatgett tgcgtattgx 9cggj-ytuv. ttttatcaaa aggaaacatg 1320 
aaagaaaagc taatctctct ctggatggcg "tggttttc ««™™L ttttttccaa 1380 
gagctagagg atgtgggcga ^gaagtatgg aaagaattat acttgagg^ c atttg 1440 
Sagattgaag ttaaagatgg ta aaacttax ^aagaugt tc £ g tgaaat aaataaacac 1S00 
gcaacatctc tgttttcagc aaacacatca agcagcaata tccg 9 ca ctcttccc 1560 
Igttacacac atatgatgtc cattggtttc accgaagtgg c attt aataag 1620 

ftacclfcxx 2S?S£g. fcxagfafa? fglacctfta tggcagtggc 1680 
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atgcgtagtc ttccaaagca gttatgcaag 
tattgcacca agctttgttg tttgccaaaa 
cttttacttg atggtagcca gtcattgact 
tgccttaaga ctctaggtca atttgttgtt 
ctaggaaacc taaatctcta tggctcaatt 
gataaggacg caaaagaagc caatttatct 
agttggaata actttggacc acatatatat 
ctcaaaccac actccaatct gacttcttta 
ccagagtgga tgaatcactc agtattgaaa 
agaaactgct catgcttacc accctttggt 
cactgggggt ctgcggatgt ggagtatgtt 
ttccccacaa gaataaggtt tccatccttg 
ctgaaaggat tgctgaaaaa ggaaggagaa 
attcacgagt gcccttttct gaccctttct 
atttgctata ataaagtagc tacttcattc 
ctcaaatact tgacaatctc tcggtgcaat 
agtctgaatg ctttgaaaag tctaaaaatt 
gaggaagggc tggaaggttt atcttcactc 
ctaaaatgtt taccagaggg attgcagcac 
ggatgtccac aactgatcaa gcggtgtgag 
tctcacattc ctaatgtgaa tatatatatt 



cttcaaaatc tgcaaactct rgatctacaa 1740 
gaaacaagta aacttggtag tctccgaaat 1800 
tgtatgccac caaggatagg atcattgaca 1860 
ggaaggaaga aaggttatca acttggtgaa 1920 
aaaatctcgc atcttgagag agtgaagaat 1980 
gcaaaaggga atctgcattc tttaagcatg 2040 
gaatcagaag aagttaaagt gcttgaagcc 2100 
aaaatctatg gcttcagagg aatccatctc 2160 
aatattgtct ctattctaat tagcaacttc 2220 
gatctgcctt gtctagaaag tctagagtta 2280 
gaagaagtgg atattgatgt tcattctgga 2340 
aggaaacttg atatatggga ctttggtagt 2400 
gagcaattcc ctgtgcttga agagatgata 2460 
tctaatctta gggctcttac ttccctcaga 2520 
ccagaagaga tgttcaaaaa, ccttgcaaat 2580 
aatctcaaag agctgcctac cagcttggct 2640 
caattgtgtt gcgcactaga gagtctccct 2700 
acagagttat ttgttgaaca ctgtaacatg 2760 
ctaacaaccc tcacaagttt aaaaattcgg 2820 
aagggaatag gagaagactg gcacaaaatt 2880 
taa ~ 2913 



<210> 36 
<211> 3591 
<212> DNA 

<213> Solanum bul bocastanum 
<220> 

<221> mi sc_feature 
<222> (1). .(3591) 

<223> /note="Rpi -bib including intron sequence (position 
428-1106)" 

<400> 36 

atggctgaag ctttcattca agttctgcta gacaatctca cttctttcct caaaggggaa 60 
cttgtattgc ttttcggttt tcaagatgag ttccaaaggc tttcaagcat gttttctaca 120 
attcaagccg tccttgaaga tgctcaggag aagcaactca acaacaagcc tctagaaaat 180 
tggttgcaaa aactcaatgc tgctacatat gaagtcgatg acatcttgga tgaatataaa 240 
accaaggcca caagattctc ccagtctgaa tatggccgtt atcatccaaa ggttatccct 300 
ttccgtcaca aggtcgggaa aaggatggac caagtgatga aaaaactaaa ggcaattgct 360 
gaggaaagaa agaattttca tttgcacgaa aaaattgtag agagacaagc tgttagacgg 420 
gaaacaggta ctcatcttaa attagtatta caacaactaa gtttatattc atttttttgg 480 
caattatcaa attcagaaaa gggttaaata tactcatgtc ctatcgtaaa tagtgtatat 540 
atacctctcg ttgtactttc gatctgaata tacttgtcaa atctggcaag ctcagaatca 600 
aattatccac cccaactttt aaatactcga tatctttaga aatccacctg tctaactcat 660 
ccactaccca ttccctttgc tttgaattct tttctttacc tataaacttg gaacactcga 720 
tccgttttgc ttttcttaac aaagcagctc agagaaaaga ggttttcttc tattctgttt 780 
ctctgtgtgc tgcacttggg tccttaatcc cattaaaaac agggcatgtt aatcccaacg 840 
acggtagcct ttcctgacag ctgactgtaa attttgtcta acaaagaaaa aaaaagatta 900 
gacatgtttt tccttgtcat tgattaggct ggatttcttt cagagtggaa cataggggat 960 
atattggacc aaaagtagaa tgggtatata tttaaagtat ttctgataga acaggagtat 1020 
attgtgcgaa aatatcctct attttctgtt gtctcctaat gagtttgaat gtaataatat 1080 
tctcatgtgg acattgcttg caccaggttc tgtattaacc gaaccgcagg tttatggaag 1140 
agacaaagag aaagatgaga tagtgaaaat cctaataaac aatgttagtg atgcccaaca 1200 
cctttcagtc ctcccaatac ttggtatggg gggattagga aaaacgactc ttgcccaaat 1260 
ggtcttcaat gaccagagag ttactgagca tttccattcc aaaatatgga tttgtgtctc 1320 
ggaagatttt gatgagaaga ggttaataaa ggcaattgta gaatctattg aaggaaggcc 1380 
actacttggt gagatggact tggctccact tcaaaagaag cttcaggagt tgctgaatgg 1440 
aaaaagatac ttgcttgtct tagatgatgt ttggaatgaa gatcaacaga agtgggctaa 1500 
tttaagagca gtcttgaagg ttggagcaag tggtgcttct gttctaacca ctactcgtct 1560 
tgaaaaggtt ggatcaatta tgggaacatt gcaaccatat gaactgtcaa atctgtctca 1620 
agaagattgt tggttgttgt tcatgcaacg tgcatttgga caccaagaag aaataaatcc 1680 
aaaccttgtg gcaatcggaa aggagattgt gaaaaaaagt ggtggtgtgc ctctagcagc 1740 
caaaactctt ggaggtattt tgtgcttcaa gagagaagaa agagcatggg aacatgtgag 1800 
agacagtccg atttggaatt tgcctcaaga tgaaagttct attctgcctg ccctgaggct 1860 
tagttaccat caacttccac ttgatttgaa acaatgcttt gcgtattgtg cggtgttccc 1920 
aaaggatgcc aaaatggaaa aagaaaagct aatctctctc tggatggcgc atggttttct 1980 
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tttatcaaaa 

cttgaggtct 

tgatctcatc 

ccgtgaaata 

gtttttttac 

tgattcgaca 

gaacctgtat 

gcaaactctt 

acttggtagt 

aaggatagga 

aggttatcaa 

tcttgagaga 

tctgcattct 

agttaaagtg 

cttcagagga 

tattctaatt 

tctagaaagt 

tattgatgtt 

tatatgggac 

tgtgcttgaa 

ggctcttact 

gttcaaaaac 

gctgcctacc 

cgcactagag 

tgttgaacac 

cacaagttta 

agaagactgg 



ggaaacatgg 

tttttccaag 

catgatttgg 

aataaacaca 

actcttcccc 

tttaataagt 

ggcagtggca 

gatctacaat 

ctccgaaatc 

tcattgacat 

cttggtgaac 

gtgaagaatg 

ttaagcatga 

cttgaagccc 

atccatctcc 

agcaacttca 

ctagagttac 

cattctggat 

tttggtagtc 

gagatgataa 

tccctcagaa 

cttgcaaatc 

agcttggcta 

agtctccctg 

tgtaacatgc 

aaaattcggg 

cacaaaattt 



agctagagga 

agattgaagt 

caacatctct 

gttacacaca 

ccttggaaaa 

taccatcttc 

tgcgtagtct 

attgcaccaa 

ttttacttga 

gccttaagac 

taggaaacct 

ataaggacgc 

gttggaataa 

tcaaaccaca 

cagagtggat 

gaaactgctc 

actgggggtc 

tccccacaag 

tgaaaggatt 

ttcacgagtg 

tttgctataa 

tcaaatactt 

gtctgaatgc 

aggaagggct 

taaaatgttt 

gatgtccaca 

ctcacattcc 



tgtgggcgat 

taaagatggt 

gttttcagca 

tatgatgtcc 

gtttatctcg 

cattggagat 

tccaaagcag 

gctttgttgt 

tggtagccag 

tctaggtcaa 

aaatctctat 

aaaagaagcc 

ctttggacca 

ctccaatctg 

gaatcactca 

atgcttacca 

tgcggatgtg 

aataaggttt 

gctgaaaaag 

cccttttctg 

taaagtagct 

gacaatctct 

tttgaaaagt 

ggaaggttta 

accagaggga 

actgatcaag 

taatgtgaat 



gaagtatgga 
aaaacttatt 
aacacatcaa 
attggtttcg 
ttaagagtgc 
ctagtacatt 
ttatgcaagc 
ttgccaaaag 
tcattgactt 
tttgttgttg 
ggctcaatta 
aatttatctg 
catatatatg 
acttctttaa 
gtattgaaaa 
ccctttggtg 
gagtatgttg 
ccatccttga 
gaaggagaag 
accctttctt 
acttcattcc 
cggtgcaata 
ctaaaaattc 
tcttcactca 
ttgcagcacc 
cggtgtgaga 
atatatattt 



aagaattata 2040 
tcaagatgca 2100 
gcagcaatat 2160 
ccgaagtggt 2220 
ttaatctagg 2280 
taagatacxt 2340 
ttcaaaatct 2400 
aaacaagtaa 2460 
gtatgccacc 2520 
gaaggaagaa 2580 
aaatctcgca 2640 
caaaagggaa 2700 
aatcagaaga 2760 
aaatctatgg 2820 
atattgtctc 2880 
atctgccttg 2940 
aagaagtgga 3000 
qgaaacttga 3060 
agcaattccc 3120 
ctaatcttag 3180 
cagaagagat 3240 
atctcaaaga 3300 
aattgtgttg 3360 
cagagttart 3420 
taacaaccct 3480 
agggaatagg 3540 
aa 3592 



<210> 37 
<211> 7349 
<212> ONA 

<213> S. bulbocastanum BAC SPB4 

<220> 

<221> misc_feature 

<222> (D..C7349) . 

<223> /note="Genoimc fragment 



<400> 37 

gatcttttaa 

caaatatata 

acttaaatta 

acttgttaat 

aatgttgtgt 

tataaatatg 

cacaagtaga 

caaggtaata 

tctcctcaaa 

aaaatttaga 

aaaaagtttt 

ataaaaaaga 

aaaaaaaaag 

tttatatata 

atgaagtgaa 

tcctttcata 

gaaaaggaaa 

agttcaagta 

catttagcat 

tattctatat 

aagaaaaata 

cataaatata 

aagggcaaag 

ttgaacaatg 

tttagaggca 

gaaagtcaat 

cagaaaaatt 

atcaatatga 



atattttgaa 
aaarttattt 
ttagactctc 
gttgtaatta 
cacttggaaa 
atctttaact 
ttgacaggct 
gtatattcca 
gtagttaata 
tgaaatttac 
taataatttg 
ataataaaat 
aaagaaatta 
ataaaattaa 
atgacgtggt 
ctaatgataa 
atggggcggt 
atcaaccaac 
aaaaaatttc 
taattaaatt 
tttaaggaca 
actttgtaaa 
atggaaaaga 
aaaaaaatct 
aaaaattagt 
ttgactaatt 
agatattcaa 
taaaaaaata 



ttagcaatta 
tttgaaaaaa 
gaaaaatgaa 
ttggcgaaca 
aaacaccaaa 
taattgcagc 
tgctaattga 
gacaaatttt 
tacttttgag 
tttctagtta 
acttttatgc 
tatatgataa 
gatgacaaca 
atttaaaatt 
ggtgggtcca 
attttttttt 
aattacaaag 
taaactacta 
attgcactta 
ttctatatta 
taatttaact 
tagtgcaatt 
aactaaatat 
caaaaattca 
actccatccg 
tttaaagcta 
aaactataca 
tatcgtaaaa 



ttgtgactat 
agaagatttc 
aagtgtcaca 
ataatgttgt 
tagaactatt 
agacaggcat 
gtgtctgtta 
gtgttaccaa 
tgttgtatca 
aattggtcaa 
tatatttttt 
tttttataat 
atgtccaaaa 
tgatgaaaaa 
tgtgacataa 
tttttttttt 
tagggaatcg 
aaatttttct 
cttttagagt 
attaaatttg 
catatttttc 
tatctcctag 
tcatcttaaa 
attaatatga 
ttcactttga 
aattagatta 
aaaaatatta 
tattagtcaa 



aatacttttt 
atgcgcaaat 
taaattgaca 
tgattatcac 
catgtttttt 
gatctttaac 
taatcagtat 
attaaatata 
tgtttttaat 
agttgaaaga 
taaagttgaa 
acaatggcct 
ataatcttaa 
atagagaaaa 
aaaaaaattc 
ttttactaat 
aactttatca 
aattaatgat 
tttgaaaaca 
tgaggtaata 
actattgttt 
aagcaaattt 
ctttgaacaa 
aatggagaga 
tttgtcatgt 
cactaattca 
tacattgcaa 
aatttttata 



acataatttg 
tccaggtcaa 
caaagggagt 
tttctgaata 
ctttagtata 
tttaaatgtg 
taattactct 
tttctaaaac 
ataaaatgtt 
atttcaagtg 
cgacttrtta 
ttatatgatg 
agaattacga 
gaggaagatg 
tcttaaataa 
tgcgtataga 
acaagttgag 
aattgtaatt 
gtacttcatc 
caaacttatt 
tttgtgaaat 
caccaaagaa 
ttcaattatt 
gtaactttat 
tgcactrttc 
atattttaaa 
ttttttgcat 
atttgactca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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aatcatgaaa agtataataa ttaatagtgg acggaggaag 
ggccattttt ggtccaaggg ccattagcag ttctcttcat 
tagatgggca tcttactaaa aatatttgtc tcatattact 
aagaattaat taattttttc tcattttacc cctacaatta 
aacaaatttt gaagaatcaa aatttctttt gcaagagact 
aaaataataa aagctgtcaa tttattgacc atcacttaat 
ctgatctaat atgagacgga caaaatatat tctaaaatat 
ttctaaccat tcactacact atattatgca ttttatccgc 
aattaattag gaaagaggaa actgccaatg aggaagagta 
gaaaaaaaga taatactcac tcttttcgat ttttattttt 
catgtaaaaa gataattatt tttttcatgc tttatcctta 
gattattttg taaaatattt atatgaataa ttgttttcgt 
caatgataaa taaaaatgaa tgaagagagt agaaaacaaa 
cttgagagat taaaagggtc caaaacgcct tggattttga 
catgaaataa ttgaatttgt attattacaa gtcaaacttt 
atcttggttt caaaattaca cattcattca ttcacagatc 
tccacatatg gctgaagctt tcattcaagt tctgctagac 
aggggaactt gtattgcttt tcggttttca agatgagttc 
ttctacaatt caagccgtcc ttgaagatgc tcaggagaag 
agaaaattgg ttgcaaaaac tcaatgctgc tacatatgaa 
atataaaacc aaggccacaa gattctccca gtctgaatat 
tatccctttc cgtcacaagg tcgggaaaag gatggaccaa 
aattgctgag gaaagaaaga attttcattt gcacgaaaaa 
tagacgggaa acaggtactc atcttaaatt agtattacaa 
tttttggcaa ttatcaaatt cagaaaaggg ttaaatatac 
tgtatatata cctctcgttg tactttcgat ctgaatatac 
agaatcaaat tatccacccc aacttttaaa tactcgatat 
aactcatcca ctacccattc cctttgcttt gaattctttt 
cactcgatcc gttttgcttt tcttaacaaa gcagctcaga 
tctgtttctc tgtgtgctgc acttgggtcc ttaatcccat 
cccaacgacg gtagcctttc ctgacagctg actgtaaatt 
aagattagac atgtttttcc ttgtcattga ttaggctgga 
aggggatata ttggaccaaa agtagaatgg gtatatattt 
ggagtatatt gtgcgaaaat atcctctatt ttctgttgtc 
ataatattct catgtggaca ttgcttgcac caggttctgt 
atggaagaga caaagagaaa gatgagatag tgaaaatcct 
cccaacacct ttcagtcctc ccaatacttg gtatgggggg 
cccaaatggt cttcaatgac cagagagtta ctgagcattt 
gtgtctcgga agattttgat gagaagaggt taataaaggc 
gaaggccact acttggtgag atggacttgg ctccacttca 
tgaatggaaa aagatacttg cttgtcttag atgatgtttg 
gggctaattt aagagcagtc ttgaaggttg gagcaagtgg 
ctcgtcttga aaaggttgga tcaattatgg gaacattgca 
tgtctcaaga agattgttgg ttgttgttca tgcaacgtgc 
taaatccaaa ccttgtggca atcggaaagg agattgtgaa 
tagcagccaa aactcttgga ggtattttgt gcttcaagag 
atgtgagaga cagtccgatt tggaatttgc ctcaagatga 
tgaggcttag ttaccatcaa cttccacttg atttgaaaca 
tgttcccaaa ggatgccaaa atggaaaaag aaaagctaat 
gttttctttt atcaaaagga aacatggagc tagaggatgt 
aattatactt gaggtctttt ttccaagaga ttgaagttaa 
agatgcatga tctcatccat gatttggcaa catctctgtt 
gcaatatccg tgaaataaat aaacacagtt acacacatat 
aagtggtgtt tttttacact cttcccccct tggaaaagtt 
atctaggtga ttcgacattt aataagttac catcttccar 
gatacttgaa cctgtatggc agtggcatgc gtagtcttcc 
aaaatctgca aactcttgat ctacaatatt gcaccaagct 
caagtaaact tggtagtctc cgaaatcttt tacttgatgg 
tgccaccaag gataggatca ttgacatgcc ttaagactct 
ggaagaaagg ttatcaactt ggtgaactag gaaacctaaa 
tctcgcatct t'gagagagtg aagaatgata aggacgcaaa 
aagggaatct gcattcttta agcatgagtt ggaataactt 
cagaagaagt taaagtgctt gaagccctca aaccacactc 
tctatggctt cagaggaatc catctcccag agtggatgaa 
ttgtctctat tctaattagc aacttcagaa actgctcatg 
tgccttgtct agaaagtcta gagttacact gggggtctgc 
aagtggatat tgatgttcat tctggattcc ccacaagaat 
aacttgatat atgggacttt ggtagtctga aaggattgct 



tattgtcttt ccagatttgt 1740 
tttctacttc tgtctcatat 1800 
tgattattta ttaaatcaaa 1860 
atatagtttt aaaagtttta 1920 
tattaatata aacaaaggat 1980 
aatatataaa atacaaactg 2040 
tttcggacag atatgtgata 2100 
caatgactta tttcagcttt 2160 
ggggcgtagt tgctgtcgac 2220 
atttatcact tttaacctat 2280 
gtattaaaca atttaatagg 2340 
aatgaatttg tccggtcaaa 2400 
acaaaagaac aagttgacaa 2460 
gattccatat gtgaaatttc 2520 
ccatttcatt ccaactagcc 2580 
taatattctt aatagtgatt 2640 
aatctcactt ctttcctcaa 2700 
caaaggcttt caagcatgtt 2760 
caactcaaca acaagcctct 2820 
gtcgatgaca tcttggatga 2880 
ggccgttatc atccaaaggt 2940 
gtgatgaaaa aactaaaggc 3000 
attgtagaga gacaagctgt 3060 
caactaagtt tatattcatt 3120 
tcatgtccta tcgtaaatag 3180 
ttgtcaaatc tggcaagctc 3240 
ctttagaaat ccacctgtct 3300 
ctttacctat aaacttggaa 3360 
gaaaagaggt tttcttctat 3420 
taaaaacagg gcatgttaat 3480 
ttgtctaaca aagaaaaaaa 3540 
tttctttcag agtggaacat 3600 
aaagtatttc tgatagaaca 3660 
tcctaatgag tttgaatgta 3720 
attaaccgaa ccgcaggttt 3780 
aataaacaat gttagtgatg 3840 
attaggaaaa acgactcttg 3900 
ccattccaaa atatggatrt 3960 
aattgtagaa tctattgaag 4020 
aaagaagctt caggagttgc 4080 
gaatgaagat caacagaagt 4140 
tgcttctgtt ctaaccacta 4200 
accatatgaa ctgtcaaatc 4260 
atttggacac caagaagaaa 4320 
aaaaagtggt ggtgtgcctc 4380 
agaagaaaga gcatgggaac 4440 
aagttctatt ctgcctgccc 4500 
atgctttgcg tattgtgcgg 4560 
ctctctctgg atggcgcatg 4620 
gggcgatgaa gtatggaaag 4680 
agatggtaaa acttatttca 4740 
ttcagcaaac acatcaagca 4800 
gatgtccatt ggtttcgccg 4860 
tatctcgtta agagtgctta 4920 
tggagatcta gtacatttaa 4980 
aaagcagtta tgcaagcttc 5040 
ttgttgtttg ccaaaagaaa 5100 
tagccagtca ttgacttgta 5160 
aggtcaattt gttgttggaa 5220 
tctctatggc tcaattaaaa 5280 
agaagccaat ttatctgcaa 5340 
tggaccacat atatatgaat 5400 
caatctgact tctttaaaaa 5460 
tcactcagta ttgaaaaata 5520 
cttaccaccc tttggtgatc 5580 
ggatgtggag tatgttgaag 5640 
aaggtttcca tccttgagga 5700 
gaaaaaggaa ggagaagagc 5760 
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aattccctgx gcxtgaagag atgataattc acgagtgccc "ttctgacc ctttcttcu 5820 
atcttagggc tcttacxtcc ctcagaaxtt ggataataa |g«g * ataatc 5940 

aagagatgtt caaaaacctt fcaaatctca aatacxxgac aa «| aaaattcaat 6000 

xciaagagct gcctaccagc ^^ggctagtc *9aatgctxx ga y tcactcac ag 6060 
tgtgttgcgc actagagagt "ccctgagg !|999«gga *9| cagcacctaa 6120 

agttatttgt tgaacactgt aacatgctaa aatgxxxacc ay yyy * 618Q 
caaccctcac aagtttaaaa axtcggggat 9«cacaact gatcaag yy v^^l 624 q 
gaataggaga agactggcac aaaatttctc acattcctaa tgxgaa tgattgcat g 6300 
ttatttgcta ttgtttcttt 9«tgtgagt "ttttggtt cctgccax^g « att » tctt 6360 
taattttttt ctagggttgt "gtttgttg agtctctctc xca ss acttatttgt 6420 
ttggtaacaa attaacaatc ^atttgtatt "acgctttc ag tgattttatt 6480 

aattgtttct ttgtttgtaa attgtgagta ^ttattgta ^g tttatgtgac 6540 

ttgaaaacaa atcaaxaaga tccatctgta "atactccc xxcg aaaca ttttg 6600 

actxttxgga tttcgagatt "ttgatctt aaatttttca tag taaaattaat 6660 

aattatcaat tattgagatt ttagxatttt "axgxag^ » t tact agactc 6720 

ittttaaaaa aaagaagaxt tcatgcgcat attcccgatc aaactxaa^ » taat g780 

tcgaaaaatg aaaagtgtca cataaattga facagaggga gx t y gtC acgtgga 6840 
tattggcgaa caataatgtt 99tgattatc actttctgaa » y * tttttaact t 6900 
aaaaacacca aatagaagta "catgcttt «tagtatat at y acttgtataa 6960 

ggtttcagcg gatagtcatg acctttaact ftgaatgtgc acaay « tacC aattgc 7020 
alttaaacaa attttataaa "tatacaat atgacactga gagtaaxxg^ gg ^go 

agtcgtxgct gctttxcgat tctct f^3t ^tctaggxa a^y acattttggt 7140 

ccaaaaatax Ecctgaagta ccagaaaagg tctcaaaata "aacca ccacattttt 7200 
ctaaaaatat ccttctactc atcctttttt 9tctaaaatt accctttc ^ 7250 

taxxxxcxxa SgSSSS xaaaaaxfca SSSSS xtttttccca taattxaxcc 7320 
aaatcaaaac aatatatttx ttcaagatc 



<210> 38 

<211> 3260 

<213> solarium bulbocastanum 
<220> 

<221> misc_feature 

<222> CD - (3260) . 

<223> /note= RGCl-blb 



<400> 38 ,-ftrtnrta aataatctca cttttttcat ccaaggggaa 60 

atggctgaag ctttccttca agttctgcta gataaxcxca u gttttcaatg 120 

ctxggattgg tttttggttt cgagaaggag "taaaaaac «tcaagx 9 aa » ^0 

atccaagctg tgctagaaga ^"caagag aagcaacxga ay tgactgtaaa 240 

tggttacaga aactcaatgt tgctgcatat ^agttgatg a ^y* gaccatcact 300 
actgaggcag caagattcaa gcaggcxgxa ^|yua a aaaaa ctaga tgcaattgca 360 
ttctgttaca aggtgggaaa aagaatgaaa gaaaxgatgg aaaaa y t | cta gacgg 420 
gaggaacgga ggaattttca tttagatgaa aggattatag agaga y t ? gca gagaa 480 
Haacaggxg ctcatcttaa "ttatttta aaacaaataa |tattac t | tt « t ? aa 540 
acgaaggaat ttatattcat "ttattttt 99caattatc a 9 t * a * tcagcat 600 
faxxxx^ca KSSK KSSS |cjactgt|c aaaxatctxc tctattxxct 6 0 
xga^caaa ffifSSS ™ ggggg 9 a|g f a -txgaxaa 780 

g a a 33?c SSSS ft ™ gjSSS IS 

taaagatatg ggtttgtgtc tcagatgatt "gatgagaa 9a99^9 cagaaaaagc 1020 
tagaltctat tgaaggaaag tcactgggtg acatggactt |gctcccctc c 9 ^ ^80 
xtcaggagtt gttgaatgga aaaagatact "^"9xxxl «y uy « ggtgcttcaa 1140 
atcalgalaa Itgggataat cttagagcag tattgaagat t99agctagt g| a 9 tatatc ^ 
ttctaattac tactcgtctt gaaaaaattg gatcaattax ggga ay c 126 q 

agttaxcaaa tttgtcxcaa 9fagattgtt |9«9ttgtt caagc a g » agaaat « tg 1320 
accaaaccga aacaagtcct aaacttatgg aaatcggaaa gg g y & y a 3 
ggggtgtgcc tctagcagcc aaaactcttg 9aggccxx^ y » g I aaa ttctg 1440 

gtgaatggga acatgtgaga gatagtgaga tttggaattx ««c y » 1500 
ttttgcctgc cctgaggctg agttatcatc atcttccact ^«^9 a 9* atcg 2tctct 1560 
catattgcgc agtattccca "ggacacca aaatagaaaa ggaaxaxc y 162Q 

g aa^ g a?g^aa SJSSSSJ xS5SS«? «x«caf?a 5.«S.28« aaixctggxa 1680 
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aaacttattt caagatgcat gatctcatcc atgatttggc tacatctatg ttttcagcaa 1740 
gcgcatcaag cagaagtata cgccaaataa atgtaaaaga tgatgaagat atgatgttca 1800 
ttgtaacaaa ttataaagat atgatgtcca ttggtttctc cgaagtggtg tcttcttact 1860 
ctccttcgct ctttaaaagg tttgtctcgt taagggtgct taatctaagt aactcagaat 1920 
ttgaacagtt accgtcttcc gttggagatc tagtacattt aagatacctt gacctgtctg 1980 
gtaataaaat ttgtagtctt ccaaagaggt tgtgcaagct tcaaaatctg cagactcttg 2040 
atctatataa ttgccagtca ctttcttgtt tgccgaaaca aacaagtaag ctttgtaqtc 2100 
tccggaatct tgtacttgat cactgtccat tgacttctat gccaccaaga ataggattgt 2160 
tgacatgcct taagacacta ggttactttg ttgtaggcga gaggaaaggt tatcaacttg 2220 
gtgaactacg aaatttaaac ctccgtggtg caatttcaat cacacatctt gagagagtga 2280 
aaaatgatat ggaggcaaaa gaagccaatt tatctgcaaa agcaaatcta cactctttaa 2340 
gcatgagttg ggatagacca aacagatatg aatccgaaga agttaaagtg cttgaagccc 2400 
tcaaaccaca tcccaatctg aaatatttag aaatcattga cttctgtgga ttctgtctcc 2460 
ctgactggat gaatcactca gttttgaaaa atgttgtctc tattctaatt agcggttgtg 2520 
aaaactgctc gtgcttacca ccctttggtg agctgccttg tctagaaagt ctggagttac 2580 
aagacgggtc tgtggaggtg gagtatgttg aagattctgg attcctgaca agaagaagat 2640 
ttccatccct gagaaaactt catataggtg gcttttgtaa tctgaaagga ttgcagagaa 2700 
tgaaaggagc agagcaattc cccgtgcttg aagagatgaa gatttcggat tgccctatgt 2760 
ttgtttttcc gaccctttct tctgtcaaga aattagaaat ttggggggag gcagatgcag 2820 
gaggtttgag ctccatatct aatctcagca ctcttacatc cctcaagatt ttcagtaacc 2880 
acacagtgac ttcactactg gaagagatgt tcaaaaacct tgaaaatctc atatacttga 2940 
gtgtctcttt cttggagaat ctcaaagagc tgcctaccag cctggctagt ctcaacaatt 3000 
tgaagtgtct ggatattcgt tattgttacg cactagagag tctccccgag gaagggctgg 3060 
aaggtttatc ttcactcaca gagttatttg ttgaacactg taacatgcta aaatgtttac 3120 
cagagggatt gcagcaccta acaaccctca caagtttaaa aattcgggga tgtccacaac 3180 
tgatcaagcg gtgtgagaag ggaataggag aagactggca caaaatttct cacattccta 3240 
atgtgaatat atatatttaa 3260 



<210> 39 
<211> 3971 
<212> DNA 

<213> Solanum bul bocastanum 

<220> 

<221> mi sc_feature 
<222> (1). . (3971) 
<223> /note= M RGC3-blb" 

<400> 39 

atggctgaag ctttcattca agttgtgcta gacaatctca cttctttcct caaaggggaa 60 
cttgtattgc ttttcggttt tcaagatgag ttccaaaggc tttcaagcat gttttctaca 120 
atccaagccg tccttgaaga tgctcaagag aagcaactca acgacaagcc tctagaaaat 180 
tggttgcaaa aactcaatgc tgctacatat gaagtcgatg acatcttgga tgaatataaa 240 
actaaggcca caagattctt gcagtctgaa tatggccgtt atcatccaaa ggttatccct 300 
ttccgtcaca aggttgggaa aaggatggac caagtgatga aaaaactgaa tgcaattgct 360 
gaggaacgaa agaattttca tttgcaagaa aagattatag agagacaagc tgctacacgg 420 
gaaacaggta ctcatcttaa attagtatta caacttagtt tatattcatt tgttttgggc 480 
aatgatcaaa ttatgtaaag gtcaaatata ctcatgtact actgaaaata gtttaaatat 540 
acctctagtt atactattag tacgaacata ctcctcccat atactttgga acaaatattc 600 
ccttaacgaa ataagacacg tgaaaagttc agattcaaat tatccaccct caattttaag 660 
atctgatttc tttaggaaac cactcatctc ctccgttttg agttcttaac gaagcagctc 720 
agagaaaaga ggttttcttc tgttctgttt ctgctgcatt tgtgtcttaa tccaataaca 780 
aacaatacaa attaatatta tgttcacgat gagggtagtc tttctagcta gacatgaact 840 
gagtgtaaat tttgttttaa ggaagaaaaa gaaatgatta ggctggattt ctttcagaqt 900 
ggaatatagg gggataaagt tggagcatag agttccatcg tttatttctt tccttaaagt 960 
aacaagttca acaaaatgat atcaaggtac ggtaatggaa aattattaga cacgtctaaa 1020 
ctacaaaaat ggaatagaaa cttaaattat cagtgacaat atcatccttt aataaagcta 1080 
ccaaatttaa atcatgatac agagaagaaa ccaaaaaaat taggggtgaa ttatttgatt 1140 
ctatgcttat cacatgtctt cccatcaaca tcaaaggaaa aattgtgcca aagtataaac 1200 
ggtgcggtat atttggattg aaagtaaaac aggaggatac atttggacta aaagtataac 1260 
aataagtata tttgatcatt ttatgtatca aattcatgtg gtttttgggg agaagggaag 1320 
tttcaatgtt ttcaatctgc tcctcatctc atccatatct ctttattgtg caaaaccctt 1380 
ctctatttaa ctattttctg ccgactccta atgagcttga atgtaacaat attctcatct 1440 
ggacattgct tgcaccaggt tctgtgttaa ctgaaccaca agtttatgga agggacaaag 1500 
aaaaagatga gatagtgaaa atcctaataa acaatgttag tgatgcccaa aaactctcag 1560 
tcctcccaat acttggtatg gggggactag gaaagacaac tctttcccaa atggtcttca 1620 
atgatcagag agtaactgag cgtttctatc ccaaaatatg gatttgcgtc tcggatgatt 1680 
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ttgatgagaa gaggt tgata aaggca f ag t.ga«tc«t tgaag^aag tccctcagxg 1740 

SS8?£ 8SSS5 il l sd& 88888 SSKS3 

tcttgaaggt tggagcaagt Mtgcatttg ^tctaacra tctgt » tcca gaggattgtt 1980 
gatcaattat gggaacattg caaccatatg „ aataaatcca aaccttgtgg 2040 

ggtttttgtt catgcagcgt gcatttggac a «aagaaga aa ™ aa aagactcttg 2100 

SS38S SSSSS5S a 4S||lI||| « ?; || 

aaatggcaaa ggaaaatctt "cgctttrt 9|atggcaca *99 ttgaggtctt 2400 

gaaatttgga gctagaggat 9taggtaatg aa 9^99 c | agat gcat gacctcatcc 2460 
?cttccaaga gattgaagtt 9 aa «tggta ^acttattt caa? g^ « gtgaaa taa 2520 
atgatttggc tacatctctg «"«gcaa -"caw-a tgaagtggta tcttcttact 2580 
atgctaatta tgatggatat atfatgtcga ttggttttgc zg g hh aactc aacc 2640 
ctccttcact cttgcaaaag tttgtctcat ^aagggtg--t aaaatacctg gacttgtctg 2700 
taaatcaatt accatcttcc attggagatc tagtacattt aagatacctg ■ » act » 2760 
gcaattttag aattcgtaat cttccaaaga 9attatgcag f caaacaagt aaac t tgg ta 2820 
ttgatctaca ttattgcgac "tctttc^x 9_«^" tc aaC gccacca aggataggat 2880 
gtctccgaaa tcttttactt gatggctgtt "ttgacg^c « » ggttatcaac 2940 

tgttgacatg ccttaagtct ctaagttgct "gttattgg Ettgacagag 3000 

ttggtgaact aaaaaaccta aatctctatg fctcaatxxc ctgcactctt 3060 

iqaagiaaga tagcgatgca aaagaagcta atttatctgc ^aaagcaaax a 3120 
tatgcctgag ttgggacctt gatggaaaac atagatatga ^«|aagxx « « tcc 318 q 
tcaaaccaca ctccaatctg aaatatttag aaatcaatgg 5 a itaaaatt agaggttgtg 3240 
cagattggat gaatcaatca gttttgaaaa ftgttgtctc "ttagaatt a » 3300 

aaiactactc atgcttacca cccxttggtg agctgccttg ^tagaaagr " y y 
acaccgl^c aglagatgtg «g»tgtq | a ||^ a la SS«5§?S afaaaggaag 3420 
ccttqaggaa acttgttata tgggacttta %l**Yllt$t rtaataccct atgtttqtta 3480 
gagalaagca attccctgtg cttgaagaga tgacatttta Jggfg" acagttttga 3540 
ttccgaccct ttcttctgtc aagacattga ^agttattgc g"|*^ ac gtagaagcta 3600 

ffi&SS S2S5S | SI « SKWg Uj*« us 

SK85 JSSSK S3S5S i f| 3 sfeSsfe 138 

"""til? c 9 a 9 tt9tct 9 tca 9 t;;ct SXIgSJ S!?«al 9 c 3900 

SS1SS"5IS SSSSSS SKSS- 3 |f o 



tatatgagtg 



<210> 40 
<211> 3899 

<212> DNA 

<213> solatium bulbocastanum 
<220> 

<221> misc_feature 
<222> CD..C3899) 
<223> /r.ote='RGC4-blb 



linisiiiiill'iiil 
liiliiiiliillSl 

JSSSSR SKSS lllHHH! "fel 28838 SSSER | 

dssai sili H i i sites sass S 
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actgtctaaa 
tatctatcat 
ttgtgagtta 
ttttttaaat 
tgcaaggctt 
atctgccatt 
agctcaattg 
aaccaaaagt 
atgttaatgt 
agacgacact 
aaatatgggt 
gaaatattga 
aggagttatt 
tagaaaagtg 
tagctactac 
tgtcaaattt 
aaaaagaagc 
gtgtgccttt 
aatgggaaca 
tgcctgctct 
attgtgcagt 
tggcgcatgg 
tatggaatga 
cttatttcaa 
catcatgcgg 
gtttcgctgc 
gggtgcttaa 
tacatttaag 
gcaagcttca 
caaaacaaac 
cttctactcc 
tgggaagcaa 
tttcaatcac 
caaaagcaaa 
aatccaaaga 
agatcattgc 
aggtcatctc 
agcttccttg 
aagaggatga 
ttcgtatatg 
tccccatgct 
cttctgtcaa 
ctaatcttag 
cagaagagat 
atctcaaaga 
aaagttgtga 
cacagttgtt 
taacagccct 
aggaaatagg 



atggaaaaag 
ataaaaaaaa 
aaacagttct 
aaaaaattag 
ggcgggtcga 
cctgtctaaa 
taacaaattc 
ctacggaagg 
tgccgaagaa 
tgcccaaatg 
ttgtgtctca 
aagaagttct 
gaatggaaaa 
ggctaagtta 
tcgtcttgaa 
gtctccacat 
aaatcctaat 
agcagccaag 
tgtgagagat 
aagactgagt 
attcccaaag 
ttttctttta 
attatacttg 
gatacatgat 
caatatccgc 
agtggtgtct 
tctaagttac 
atacctggac 
aaatcttcag 
aagtaaactt 
accaaggata 
gaaaggttat 
acaccttgag 
tctgcaatct 
agttaaagtg 
cttcggagga 
tgttagaatt 
tctagaaaat 
tgtccattct 
gttctttcgc 
tgaagagatg 
gaaattagaa 
cactcttact 
grtcacaagt 
tctgcctacc 
ctcactagag 
tgttaaatac 
cacaaattta 
agaagactgg 



caaagaaata 
gcattgatta 
gttgggaagg 
caaattatgc 
aatctttgca 
tgtttgtttt 
tttgtgtcca 
gacaaagagg 
cttccagtct 
atcttcaacg 
gatgattttg 
cctcatgttg 
cgatacttgc 
agagcagtct 
aaggttggat 
gatagtttac 
ctagtggcca 
actcttggtg 
aatgagattt 
tatcatcacc 
gacaccaaaa 
tcaaagggaa 
aggtctttct 
ctaatccatg 
gaaataaatg 
tcttactctc 
tcaaaacttg 
ctgtcttgca 
actcttgatg 
agtagtctcc 
ggattgttga 
caacttggtg 
agagtgaaga 
ttaagcatga 
cttgaagcac 
ttccgttttc 
aaaagctgca 
ctagagttac 
agattctcca 
agtttgaaag 
gcgattttat 
gttcacggca 
tccctccgca 
cttacaaatc 
agcctgacta 
agtttccctg 
tgtaagatgc 
ggagtttctg 
cacaaaattg 



agaagtatac 
ctaggatatg 
atttagatac 
gggctggggc 
agtttgtgtg 
gtctataatt 
cattacttgg 
aggatgagat 
tccctataat 
atgagagagt 
atgagaagag 
aggacttggc 
ttgtcttaga 
taactgttgg 
caattatggg 
ttttgtttat 
ttggaaagga 
gtcttttacg 
ggagtctgcc 
ttccacttga 
tgataaagga 
acttggagct 
tccaagaaat 
atttggctac 
tcaaagatta 
cttcgctctt 
agcaattacc 
ataacttccg 
tacataattg 
gacatcttgt 
catgccttaa 
aactgaaaaa 
acgatacgga 
gttgggataa 
tcaaaccaca 
caagctggat 
aaaactgctt 
aaaacggatc 
caagaagaag 
ggctgatgaa 
attgccctct 
acacaaacac 
ttggtgctaa 
tcgaattctt 
gtctcaatgc 
aacaagggct 
taaaatgttt 
gttgtccaga 
ctcacattcc 



agaggctgac 
ggttctttta 
acgtggatag 
gggttgaaaa 
ggtttgccct 
cttgctgact 
aacaggtttt 
agtgaaaatt 
tggtatgggg 
aactaagcat 
gttaattaag 
ttcatttcag 
tgatgtttgg 
agcaagaggt 
aacgttgcaa 
gcaacgcgca 
gattgtgaag 
cttcaagaga 
tcaagatgaa 
tttgagacaa 
aaatctcatt 
agaggatgtg 
tgaagctaaa 
atctctgttt 
taagcataca 
gaaaaagttt 
gtcttccatt 
tagtcttcca 
ctactcactt 
tgttgatggc 
gactctaggt 
cctaaatctc 
tgcagaagcc 
cgatggacca 
ccccaatctg 
aaatcactca 
gtgcttacca 
tgcggaggtg 
ctttccatcc 
agaggaagga 
gtttgttttt 
tagaggtttg 
ctacagagcg 
gagtttcttt 
tttgaagcgt 
agaaggttta 
acccgaggga 
agtggaaaag 
aaatctggat 



ttcttcatag 1020 
aattacaaat 1080 
tatctagaag 1140 
cagcaaactt 1200 
gcaccaccca 1260 
cattctaatg 1320 
gtgttaactg 1380 
ctgataaaca 1440 
ggactaggaa 1500 
ttcaatccca 1560 
acaattatag 1620 
aagaagctcc 1680 
aatgatgatc 1740 
gcttctattc 1800 
ccatatcatt 1860 
tttgggcaac 1920 
aaatgtggtg 1980 
gaagagagtg 2040 
agttctattt 2100 
tgctttgcgt 2160 
actctctgga 2220 
ggtaatgaag 2280 
tcgggtaata 2 340 
tcggcaagcg 2400 
gtgtccattg 2460 
gtctcgttaa 2520 
ggagatctat 2580 
gagaggttgt 2640 
aattgtttgc 2700 
tgtccattga 2760 
ttctttattg 2820 
tgcggctcaa 2880 
aatttatctg 2940 
aacagatatg 3000 
aaatatttag 3060 
gttttggaga 3120 
ccctttgggg 3180 
gagtatgttg 3240 
ctgaaaaaac 3300 
gaagagaaat 3360 
ccaacccttt 3420 
agctccatat 3480 
acttcactcc 3540 
gacttcaaga 3600 
ctccaaattg 3660 
acttcactca 3720 
ttgcagcacc 3780 
cgctgtgata 3840 
attcattag 3899 



<210> 41 
<211> 970 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: deduced 
Rpi-blb protein sequence domain A t b and C 

<220> 

<221> SITE 
<222> (1). .(970) 

<400> 41 

Met Ala Glu Ala Phe He Gin val Leu Leu Asp Asn Leu Thr Ser Phe 
15 10 15 
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L eu L ys Gly Glu Leu val ueu ueu Phe Gly Phe Gin Asp Glu Phe Gin 

Arg L eu ser Ser Met Phe ser Thr lie Gin Ala Val Leu Glu Asp Ala 

Gin Glu Lys cnn Leu ash Asn Lys Pro Leu Glu Asn Trp Leu Gin Lys 

L eu aI! Ala Ala Thr Tyr Glu val Asp Asp lie Leu Asp Glu Tyr Lys 

Thr Lys Ala Thr Arg Phe Ser Gin Ser Glu Tyr Gly Arg Tyr His Pro 

Lys val He Pro Phe Arg His Lys val Gly Lys Arg Met Asg Gin val 

Met Lys Lys Zl Lys Ala He Ala Glu Glu Arg Lys Agn Phe His Leu 

115 1ZU 
His Glu Lys He val Glu Arg Gin Ala val Arg Arg du Thr Gly Ser 



130 



val ueu Thr Glu Pro Gin val Tyr Gly Arg asp Lys Glu Lys Asp Glu 
14S 150 " . 

Xle val Lys He Leu He As. Asn val Ser Asp Ala Gin H,s ueu Ser 

val ueu Pro lie ueu Gly Met Gly ^ ueu Gly uys Thr Thr ueu Ala 

Gin Met val Phi Asn asp Gin Arg val Thr Glu His Phe His ser uys 

195 ^ uu 
lie Trp He cys val ser Glu as P Phe Asp Glu uys Arg ueu He uys 



Ala xle val Glu ser g. Glu Gly Arg Pro ueu ueu Gly Glu Met Asp 



225 



L eu Ala Pro ueu Gin uys Uys ueu Gin Glu ueu ueu Asn Gly Uys Arg 



245 



Tyr ueu ueu val ueu Asp Asp val Trp Asn Glu asp Gin Gin uys Trp 



2 60 



Al. Asn ueu Arg Ala val ueu ^ val Gly Ala Ser Gly Ala ser val 

275 

L eu Thr Thr Thr Arg ueu Glu uys val Gly ser lie Met Gly Thr ueu 

290 l ^ 
Gln Pro Tyr Glu ueu ser Asn ueu ser Gin Glu Asp cys Trp ueu ueu 
305 310 

Phe Met Gin Arg Ala Phe Gly His Gin Glu Glu lie Asn Pro Asn Ueu 

val Ala xle cly uys Glu xle val uys uys ser sly Gly val Pro ueu 

Ala Ala uys Thr ueu Gly Gly xle ueu cys Phe uys Arc Glu Glu Arg 
355 360 

i K „ Ae „ Cor prc » Tie Trp Asn Leu Pro Gin Asp 
Ala Trp Glu His val Arg Asp Ser Pro lie w 

370 37b 
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10 



20 



25 



30 



35 



45 



50 



55 



Glu Ser Ser lie Leu Pro Ala Leu Arg Leu Ser Tyr His Gin Leu Pro 
385 390 395 400 

Leu Asp Leu Lys Gin cys Phe Ala Tyr cys Ala val Phe Pro Lys Asp 
405 410 415 

Ala Lys wet Glu Lys Glu Lys Leu lie ser Leu Trp Met Ala His Gly 
420 425 430 

Phe Leu Leu ser Lys Gly Asn Met Glu Leu Glu Asp Val Gly Asp Glu 
435 440 445 

val Trp Lys Glu Leu Tyr Leu Arg Ser Phe Phe Gin Glu lie Glu val 
450 455 460 

Lys Asp Gly Lys Thr Tyr Phe Lys Met His Asp Leu lie His Asp Leu 
4 65 470 475 480 

Ala Thr Ser Leu Phe Ser Ala Asn Thr Ser Ser ser Asn He Arq Glu 
485 490 495 

Ile Asn Lys His Ser Tyr Thr His Met Met Ser He Gly Phe Ala Glu 
500 505 510 

val val Phe Phe Tyr Thr Leu Pro pro Leu Glu Lys Phe Ile Ser Leu 
515 520 525 

Arg val Leu Asn Leu Gly Asp Ser Thr Phe Asn Lys Leu Pro ser ser 
530 535 540 

lie Gly Asp Leu val His Leu Arg Tyr Leu Asn Leu Tyr Gly Ser Gly 
545 550 555 560 

Met Arg Ser Leu Pro Lys Gin Leu Cys Lys Leu Gin Asn Leu Gin Thr 
565 570 575 

Leu Asp Leu Gin Tyr Cys Thr Lys Leu Cys cys Leu pro Lys Glu Thr 
580 585 590 

Ser Lys Leu Gly Ser Leu Arg Asn Leu Leu Leu Asp Gly Ser Gin Ser 
595 600 605 

Leu Thr Cys Met Pro Pro Arg Ile Gly Ser Leu Thr Cys Leu Lys Thr 
610 615 620 

Leu Gly Gin Phe val val Gly Arg Lys Lys Gly Tyr Gin Leu Gly Glu 
625 630 635 640 

Leu Gly Asn Leu Asn Leu Tyr Gly ser ile Lys lie Ser His Leu Glu 
645 650 655 

Arg val Lys Asn Asp Lys Asp Ala Lys Glu Ala Asn Leu Ser Ala Lys 
660 665 670 

Gly Asn Leu His ser Leu ser Met ser Trp Asn Asn Phe Gly Pro His 
675 680 685 

Ile Tyr Glu Ser Glu Glu val Lys Val Leu Glu Ala Leu Lys Pro His 
690 695 700 

ser Asn Leu Thr ser Leu Lys Ile Tyr Gly Phe Arg Gly ile His Leu 
705 710 715 720 

Pro Glu Trp Met Asn His Ser Val Leu Lys Asn lie val Ser He Leu 
725 730 735 



Ile Ser Asn Phe Arg Asn cys Ser Cys Leu Pro Pro Phe Gl 



y Asp Leu 
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740 745 750 

Pro Cys Leu Glu ser Leu Glu Leu His Trp Gly sen Ala Asp val Glu 
75 5 /bU 



Tyr v.l Glu Glu val asp lie Asp v.l His Ser Gly Phe Pro Thr Arg 

770 775 
He Arg Phe Pro Ser Leu Arg Lys Leu Asp lie Trp Asp Phe Gly |er 



785 ' 790 



Leu Lys Gly Leu Leu Lys Lys Glu Gly Glu Glu Gin Phe Pro val Leu 

80 5 

lie 
820 



Glu Glu Met lie He His Glu Cys Pro Phe Leu Thr Leu ser ser Asn 



Leu Arg ga Leu Thr ser Leu Arg He cys Tyr ash Lys val Ala Thr 

ser Phe III Glu Glu Met Phe Lys Asn Leu Ala ash Leu Lys Tyr Leu 

850 855 
Thr He ser Arg Cys ash Asn Leu Lys Glu Leu Pro Thr ser Leu Ala 
865 870 

ser L eu Asn Ala Lys ser Leu Lys lie Gin Leu cys Cys Ala Leu 
Glu ser Leu Pro Glu Glu Gly Leu Glu Gly Leu Ser ser Leu Thr Glu 
Le u Phe val Glu His Cys Asn Met Leu Lys cys Leu Pro Glu Gly Leu 



915 



G ln His Leu Thr Thr Leu Thr Ser Leu Lys lie Arg Gly Cys Pro Gin 
930 

L eu lie Lys Arg Cys Glu L ys Gly lie Gly Glu Asp Trp His Lys lie 

945 9 - >0 

ser His lie Pro A|n val Asn He Tyr lie 

<210> 42 
<211> 979 
<212> PRT 

<213> Artificial sequence 

<223> Description of Artificial sequence: alignment 
RGC3-blb 

<220> 

<221> SITE 
<222> CD..C979) 

5S° aiI'gIu Ala Phe lie Gin val val Leu Asp Asn Leu Thr Ser Phe 



Lei Lys Gly Glu Leu Val Leu Leu Phe Gly Phe Gin Asp Glu Phe Gin 
Arg L eu ser ser Met Phe Ser Thr He Gin Ala val Leu Glu Asp Ala 
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Gin Glu Lys Gin Leu Asn Asp Lys Pro Leu Glu Asn Trp Leu Gin Lys 
50 5S 60 

Leu Asn Ala Ala Thr Tyr Glu Val Asp Asp lie Leu Asp Glu Tyr Lys 
65 70 75 £o 

Thr Lys Ala Thr Arg phe Leu Gin Ser Glu Tyr Gly Arg Tyr His Pro 
85 90 95 

Lys val lie Pro Phe Arg His Lys val Gly Lys Arg Met Asp Gin val 
100 105 110 

Met Lys Lys Leu Asn Ala lie Ala Glu Glu Arg Lys Asn Phe His Leu 
115 120 125 

Gin Glu Lys lie lie Glu Arg Gin Ala Ala Thr Arg Glu Thr Gly Ser 
130 13 5 140 

val Leu Thr Glu Pro Gin val Tyr Gly Arg Asp Lys Glu Lys Asp Glu 
145 150 155 160 

lie val Lys lie Leu lie Asn Asn val ser Asp Ala Gin Lys Leu ser 
165 170 175 

val Leu Pro lie Leu Gly Met Gly Gly Leu Gly Lys Thr Thr Leu ser 
180 185 190 

Gin Met val Phe Asn Asp Gin Arg Val Thr Glu Arg Phe Tyr Pro Lys 
195 200 205 

He Trp lie Cys val ser Asp Asp Phe Asp Glu Lys Arg Leu He Lys 
210 215 220 

Ala lie val Glu Ser He Glu Gly Lys Ser Leu Ser Asp Met Asp Leu 
225 230 235 240 

Ala Pro Leu Gin Lys Lys Leu Gin Glu Leu Leu Asn Gly Lys Arg Tyr 
245 250 255 

Phe Leu val Leu Asp Asp val Trp Asn Glu Asp Gin His Lys Trp Ala 
260 265 270 

Asn Leu Arg Ala val Leu Lys val Gly Ala ser Gly Ala Phe val Leu 
275 280 285 

Thr Thr Thr Arg Leu Glu Lys val Gly Ser He Met Gly Thr Leu Gin 
290 295 300 

Pro Tyr Glu Leu Ser Asn Leu ser Pro Glu Asp Cys Trp Phe Leu Phe 
305 310 315 320 

Met Gin Arg Ala Phe Gly His Gin Glu Glu lie Asn Pro Asn Leu Val 
325 330 335 

Ala lie Gly Lys Glu lie val Lys Lys Cys Gly Gly val Pro Leu Ala 
340 345 350 

Ala Lys Thr Leu Gly Gly He Leu Arg Phe Lys Arg Glu Glu Arg Glu 
355 360 365 

Trp Glu His Val Arg Asp Ser Pro He Trp Asn Leu Pro Gin asp Glu 
370 375 380 

Ser Ser He Leu Pro Ala Leu Arg Leu ser Tyr His His Leu Pro Leu 
385 390 395 400 

Asp Leu Asp Gin Cys Phe val Tyr Cys Ala val Phe Pro Lys asd Thr 
405 410 415 
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Lys Met Ala Lys Glu Asn Leu He Ala Phe Trp Met Ala His Gly Phe 
' 420 425 4 3U 

Leu Leu sen Lys Gly Asn Leu Glu Leu Glu Asp val Gly Asn Glu val 
435 440 44 -> 

Trp Asn Glu Leu Tyr Leu Arq Ser Phe Phe Gin Glu He Glu val Glu 
4S0 455 460 

Ser Gly Lys Thr Tyr Phe Lys Met His Asp Leu He His Asp Leu Ala 
465 470 47 -> 

Thr ser Leu Phe Ser Ala Asn Thr ser ser Ser Asn He Arg Glu He 
485 490 

Asn Ala Asn Tyr Asp Gly Tyr Met Met Ser lie Gly Phe Ala Glu val 
500 505 => 1U 

val ser Ser Tyr Ser Pro ser Leu Leu Gin Lys Phe val ser Leu Arg 
515 520 

val Leu Asn Leu Arg Asn Ser Asn Leu Asn Gin Leu Pro Ser ser He 
530 535 540 

Gly as P Leu val His Leu Arg Tyr Leu Asp Leu Ser Gly Asn Phe Arq 
545 K 550 555 5bO 

He Arg Asn Leu Pro Lys Arg Leu cys Lys Leu Gin Asn Leu Gin Thr 

Leu Asp Leu His Tyr Cys Asp ser Leu ser Cys Leu Pro Lys Gin Thr 
580 585 ^ u 

ser Lys Leu Gly ser Leu Arg Asn Leu Leu Leu Asp Gly Cys Ser Leu 
600 605 



595 



Thr Ser Thr pro Pro Arg lie Gly Leu Leu Thr Cys Leu Lys Ser Leu 
610 615 620 

Ser Cys Phe val He Gly Lys Arg Lys Gly Tyr Gin Leu Gly Glu Leu 

62 5 ^30 o3-> 

L ys Asn Leu Asn Leu Tyr Gly Ser He Ser He Thr Lys Leu Asp Arg 

7 645 650 o** 

val Lys Lys Asp Ser Asp Ala Lys Glu Ala Asn Leu Ser Ala Lys Ala 
660 665 670 

Asn Leu His ser Leu Cys Leu ser Trp Asp Leu Asp Gly Lys His Arg 
675 6 80 oB^ 

Tvr asp ser Glu val Leu Glu Ala Leu Lys Pro His Ser Asn Leu Lys 
690 695 700 

Tyr Leu Glu He Asn Gly Phe Gly Gly He Arg Leu Pro Asp Trp Met 
705 710 715 720 

Asn Gin ser val Leu Lys Asn val Val Ser He Arg He Arg Gly Cys 
72 5 730 

Glu Asn cys Ser Cys Leu Pro Pro Phe Gly Glu Leu Pro Cys Leu Glu 
740 745 750 

ser Leu Glu Leu His Thr Gly Ser Ala Asp val Glu Tyr val Glu Asp 
755 760 765 

Asn val His Pro Gly Arg Phe pro ser Leu Arg Lys Leu val lie Trp 
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770 775 780 

Asp Phe ser Asn Leu Lys Gly Leu Leu Lys Lys Glu Gly Glu Glu Gin 
785 790 795 800 

Phe Pro val Leu Glu Glu Met Thr Phe Tyr Trp Cys pro Met Phe Val 
805 810 815 

He Pro Thr Leu ser ser val Lys Thr Leu Lys val lie Ala Thr asp 
S20 825 830 

Ala Thr val Leu Arg ser lie ser Asn Leu Arg Ala Leu Thr ser Leu 
835 840 845 

Asp lie ser Asn Asn val Glu Ala Thr ser Leu Pro Glu Glu Met Phe 
850 855 860 

Lys ser Leu Ala Asn Leu Lys Tyr Leu Asn He ser Phe Phe Arg Asn 
B65 870 875 880 

Leu Lys Glu Leu Pro Thr ser Leu Ala Ser Leu Asn Ala Leu Lys Ser 
885 890 895 

Leu Lys Phe Glu Phe Cys Asn Ala Leu Glu Ser Leu Pro Ala Glu Gly 
900 905 910 

Val Lys Gly Leu Thr Ser Leu Thr Glu Leu Ser Val ser Asn Cys Met 
915 920 925 

Met Leu Lys cys Leu Pro Glu Gly Leu Gin His Leu Thr Ala Leu Thr 
930 935 940 

Thr Leu Thr lie Thr Gin cys Pro He val Phe Lys Arg cys Glu Arq 
945 950 955 965 

Gly He Gly Glu Asp Trp His Lys He Ala His He Pro Tyr Leu Thr 
965 970 975 

Leu Tyr Glu 



<210> 43 
<211> 992 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: alignment 
RGCl-blb 

<220> 

<221> SITE 
<222> CD.. (992) 

<400> 43 

Met Ala Glu Ala phe Leu Gin val Leu Leu Asp Asn Leu Thr Phe Phe 
1 5 10 15 

He Gin Gly Glu Leu Gly Leu val phe Gly Phe Glu Lys Glu Phe Lys 
20 25 30 

Lys Leu Ser ser Met Phe Ser Met He Gin Ala val Leu Glu Asp Ala 
35 40 45 

Gin Glu Lys Gin Leu Lys Tyr Lys Ala He Lys Asn Trp Leu Gin Lys 
50 55 60 
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Leu Asn va"l 
65 

Thr Glu Ala 

Arg Thr He 

Met Glu Lys 
115 

Asp Glu Arg 
130 

val Leu Thr 
145 

lie val Lys 

val Leu Pro 

Gin Met val 
195 

He Trp val 
210 

Ala lie val 
225 

Ala Pro Leu 

phe Leu val 

Asn Leu Arg 
275 

He Thr Thr 
290 

Leu Tyr Gin 
305 

Lys Gin Arg 
Glu lie Gly 



Ala Ala 

Ala Arg 
85 

Thr Phe 
100 

Leu Asp 

He lie 

Glu Pro 

He Leu 
165 

lie Leu 
180 

Phe Asn 
Cys val 
Glu ser 



Gin Lys 
245 

Leu Asp 
260 



Ala Lys Thr 
355 

Trp Glu His 
370 

Asn Ser val 
385 

Asp Leu Arg 

Lys He Glu 



Ala val 

Arg Leu 

Leu Ser 

Ala Phe 
325 

Lys Glu 
340 

Leu Gly 
val Arg 
Leu Pro 



Tyr Glu 
70 

Phe Lys 

Cys Tyr 

Ala He 

Glu Arg 
135 

Lys val 
150 

lie Asn 

Gly Met 

Asp Gin 

Ser Asp 
215 

He Glu 
230 

Lys Leu 
Asp val 
Leu Lys 



val Asp Asp He Leu Asp Asp 
75 

Gin Ala val Leu Gly Arg Tyr 
90 

Lys val Gly Lys Arg wet Lys 
105 HO 

Ala Glu Glu Arg Arg Asn Phe 
120 125 

Gin Ala Ala Arg Arg Gin Thr Gly Phe 
140 



cys Lys 
80 

His Pro 
95 

Glu Met 



His Leu 



Tyr Gly Arg Glu Lys Glu Glu 
155 

Asn val Ser Tyr ser Glu Glu 
170 

Gly Gly Leu Gly Lys Thr Thr 
185 190 

Arq lie Thr Glu His Phe Asn 
200 205 

Asp Phe Asp Glu Lys Arg Leu 
220 

Gly Lys ser Leu Gly Asp Met 
235 

Gin Glu Leu Leu Asn Gly Lys 
250 

Trp Asn Glu Asp Gin Glu Lys 
265 270 

lie Gly Ala ser Gly Ala ser 
280 285 

Glu Lys lie Gly Ser He Met Gly Thr 
295 300 

Asn Leu Ser Gin Glu Asp Cys Trp Leu 
310 315 



Asp Glu 
160 

val Pro 
175 

Leu Ala 

Leu Lys 

lie Lys 

Asp Leu 
240 

Arq Tyr 
255 

Trp Asp 
lie Leu 
Leu Gin 



Gin Cys 
405 

Lys Glu 
420 



Cys His Gin Thr Glu Thr ser Pro Lys 
330 

lie val Lys Lys Cys Gly Gly val Pro 

345 350 

Gly Leu Leu Arg Phe Lys Arg Glu Glu 
360 365 

Asp Ser Glu lie Trp Asn Leu Pro Gin 
375 380 

Ala Leu Arg Leu Ser Tyr His His Leu 
390 395 

Phe Ala Tyr Cys Ala val Phe Pro Lys 
410 

Tyr Leu lie Ala Leu Trp Met Ala His 

y 42 5 430 



Leu Phe 
320 

Leu Met 
335 



Leu Ala 
Ser Glu 
Asp Glu 



Pro Leu 
400 

Asp Thr 
415 

Ser Phe 
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Leu Leu ser Lys Gly Asn Met Glu Leu Glu Asp val Gly Asn Glu val 
415 440 445 

Trp Asn Glu Leu Tyr Leu Arg Ser Phe Phe Gin Glu Ile Glu val Lys 
450 455 460 

Ser Gly Lys Thr Tyr Phe Lys Met His Asp Leu lie His Asp Leu Ala 
465 470 475 480 

Thr ser Met Phe Ser Ala Ser Ala Ser ser Arg Ser Ile Arg Gin Ile 
485 490 495 

Asn val Lys Asp Asp Glu Asp Met Met Phe lie Val Thr Asn Tyr Lys 
500 505 510 

Asp Met Met ser lie Gly Phe Ser Glu val val Ser Ser Tyr ser Pro 
515 520 525 

Ser Leu Phe Lys Arg Phe val Ser Leu Arg val Leu Asn Leu ser Asn 
530 535 540 

Ser Glu phe Glu Gin Leu Pro Ser Ser Val Gly Asp Leu Val His Leu 
54 5 550 555 560 

Arg Tyr Leu Asp Leu Ser Gly Asn Lys Ile cys Ser Leu Pro Lys Arg 
565 570 575 

Leu Cys Lys Leu Gin Asn Leu Gin Thr Leu Asp Leu Tyr Asn Cys Gin 
580 585 590 

Ser Leu ser Cys Leu Pro Lys Gin Thr ser Lys Leu Cys Ser Leu Arq 
595 600 605 

Asn Leu val Leu Asp His Cys Pro Leu Thr ser Met Pro Pro Arg ile 
610 615 620 

Gly Leu Leu Thr Cys Leu Lys Thr Leu Gly Tyr Phe val val Gly Glu 
625 630 635 640 

Arg Lys Gly Tyr Gin Leu Gly Glu Leu Arg Asn Leu Asn Leu Arg Gly 
645 650 65 5 

Ala Ile Ser He Thr His Leu Glu Arg val Lys Asn Asp Met Glu Ala 
660 665 670 

Lys Glu Ala Asn Leu Ser Ala Lys Ala Asn Leu His Ser Leu Ser Met 
675 680 685 

Ser Trp Asp Arg Pro Asn Arg Tyr Glu Ser Glu Glu val Lys val Leu 
690 695 700 

Glu Ala Leu Lys Pro His Pro Asn Leu Lys Tyr Leu Glu He He asd 
705 710 715 720 

Phe cys Gly Phe Cys Leu Pro Asp Trp Met Asn His Ser val Leu Lys 
725 730 735 

Asn val val Ser Ile Leu lie Ser Gly cys Glu Asn cys Ser Cys Leu 
740 745 750 

Pro Pro Phe Gly Glu Leu Pro cys Leu Glu ser Leu Glu Leu Gin Asp 
755 760 765 

Gly Ser val Glu val Glu Tyr val Glu Asp Ser Gly Phe Leu Thr Arq 
770 775 780 

Arg Arg Phe Pro Ser Leu Arg Lys Leu His Ile Gly Gly Phe cys Asn 
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785 790 795 800 

Leu Lys Gly Leu Gin Arg Met Lys Gly Ala Glu Gin Phe Pro val Leu 
805 810 815 

Glu Glu Met Lys He Sen Asp Cys Pro Met Phe Val Pne Pro Thr Leu 
820 825 830 

Ser ser val Lys Lys Leu Glu He Trp Gly Glu Ala Asp Ala Gly Gly 
835 840 845 

Leu ser Ser lie Ser Asn Leu ser Thr Leu Thr Ser Leu Lys lie Phe 
850 855 860 

Ser Asn His Thr val Thr ser Leu Leu Glu Glu Met Phe Lys Asn Leu 
865 870 875 880 

Glu Asn Leu He Tyr Leu Ser val Ser Phe Leu Glu Asn Leu Lys Glu 
885 890 895 

Leu Pro Thr ser Leu Ala ser Leu Asn Asn Leu Lys Cys Leu Asp He 
900 905 910 

Ara Tyr Cys Tyr Ala Leu Glu Ser Leu Pro Glu Glu Gly Leu Glu Gly 
915 920 925 

Leu Ser ser Leu Thr Glu Leu Phe val Glu His Cys Asn Met Leu Lys 
930 935 940 

Cvs Leu Pro Glu Gly Leu Gin His Leu Thr Thr Leu Thr Ser Leu Lys 
945 950 955 960 

He Arg Gly Cys Pro Gin Leu He Lys Arg Cys Glu Lys Gly lie Gly 
965 970 975 

Glu Asp Trp His Lys He Ser His He Pro Asn val Asn He Tyr lie 
980 985 990 

<210> 44 
<211> 1040 
<212> PRT 

<213> Artificial Sequence 
<220> .... 

<223> Description of Artificial Sequence: alignment 
RGC4-blb 

<220> 

<221> SITE 

<222> CD • - (1040) 

<400> 44 

Met Ala Glu Ala Phe Leu Gin val Leu Leu Glu Asn Leu Thr Ser Phe 
15 10 15 

He Gly Asp Lys Leu val Leu lie Phe Gly Phe Glu Lys Glu cys Glu 
20 25 30 

lvs Leu ser ser val Phe ser Thr He Gin Ala val Leu Gin Asp Ala 
35 40 45 

Gin Glu Lys Gin Leu Lys Asp Lys Ala lie Glu Asn Trp Leu Gin Lys 
50 55 60 

Leu Asn ser Ala Ala Tyr Glu val Asp Asp He Leu Gly Glu cys Lys 
65 70 75 80 
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Asn 


Glu 


Ala 


lie 


Arg 
85 


Phe 


Glu 


Gin 


Ser 


Arg 
90 


Leu 


Gly 


Phe 


Tyr 


His 
95 


Pro 


5 


Gly 


lie 


lie 


Asn 
100 


Phe Arg His Lys lie Gly Arg Arg Met Lys Glu 
105 110 


He 




Met 


Glu 


Lys 
115 


Leu 


ASp 


Ala 


lie 


Ser 
120 


Glu 


Glu 


Arg Arg 


Lys 
125 


Phe 


His 


Phe 


10 


Leu 


Glu 
130 


Lys 


lie 


Thr 


Glu 


Arg 
135 


Gin Ala Ala 


Ala Ala 
140 


Thr 


Arg 


Glu 


Thr 




val 
145 


Gly Trp 


Gin Trp Gly Trp Ala Arg 
150 


Leu 


Glu 
155 


Tyr 


Lys 


Arg 


Leu 


Leu 

•160 


15 


Leu 


Gly val 


Leu 


Met 
165 


Arg 


lie 


Met 


Ser 


Leu 
170 


Arg 


Met 


His 


val 


Ser 
175 


Thr 




cys 


Ser 


Thr 


Leu 
180 


Tyr 


Glu 


Phe 


Lys 


Phe 
185 


Tyr 


Leu 


cys 


Thr 


pro 
190 


Lys 


val 


20 


Gly 


Ala 


Arg 
195 


Arg 


cys 


Phe 


val 


Leu 
200 


Thr 


Glu 


Pro 


Lys 


Val 
205 


Tyr Gly Arg 




ASp 


Lys 
210 


Glu 


Glu 


ASP 


Glu 


He 
215 


val 


Lys 


He 


Leu 


lie Asn 
220 


Asn 


val 


Asn 


25 


val 

225 


Ala 


Glu 


Glu 


Leu 


Pro 
230 


val 


Phe 


Pro 


He 


lie Gly Met 
235 


Gly 


Gly Leu 
240 




Gly 


Lys 


Thr 


Thr 


Leu 
245 


Ala 


Gin 


Met 


He 


Phe 
250 


Asn 


ASp 


Glu 


Arg 


val 
255 


Thr 


30 


Lys 


His 


Phe 


Asn 
260 


Pro 


Lys 


He 


Trp 


val 
265 


cys 


val 


ser 


ASp 


Asp 
270 


Phe 


Asp 




Glu 


Lys 


Arg 
275 


Leu 


He 


Lys 


Thr 


He 
280 


He Gly Asn 


He 


Glu 
285 


Arg 


ser 


Ser 


35 


Pro 


His 
290 


val 


Glu 


Asp 


Leu 


Ala 
295 


Ser 


Phe 


Gin 


Lys 


Lys 
300 


Leu 


Gin 


Glu 


Leu 




Leu 
305 


Asn Gly 


Lys 


Arg 


Tyr 
310 


Leu 


Leu 


val 


Leu 


ASp 

315 


Asp 


val 


Trp 


Asn 


Asp 
320 


40 


ASp 


Leu 


Glu 


Lys 


Trp 
325 


Ala 


Lys 


Leu 


Arg 


Ala 
330 


val 


Leu 


Thr 


val 


Gly Ala 
335 




Arg 


Gly Ala 


Ser 
340 


He 


Leu 


Ala 


Thr 


Thr 

345 


Arg 


Leu 


Glu 


Lys 


val 
350 


Gly 


Ser 


45 


lie 


Met 


Gly 
355 


Thr 


Leu 


Gin 


Pro Tyr 
360 


His 


Leu 


Ser 


Asn 


Leu 
365 


Ser 


Pro 


His 




ASP 


Ser 
370 


Leu 


Leu 


Leu 


Phe 


Met 
375 


Gin 


Arg 


Ala 


Phe Gly Gin 
380 


Gin 


Lys 


Glu 


50 


Ala 
385 


Asn 


Pro 


Asn 


Leu 


val 
390 


Ala 


lie 


Gly 


Lys 


Glu 
395 


He 


val 


Lys 


Lys 


cys 
400 


Gly 


Gly val 


Pro 


Leu Ala Ala Lys Thr Leu Gly Gly Leu 
405 410 


Leu 


Arg 
415 


Phe 




Lys 


Arg 


Glu 


Glu 
420 


Ser 


Glu 


Trp 


Glu 


His 
425 


val 


Arg 


ASp 


Asn 


Glu 
430 


He 


Trp 


55 


Ser 


Leu 


Pro 
435 


Gin 


Asp 


Glu 


Ser 


Ser 
440 


He 


Leu 


pro Ala 


Leu 

445 


Arg 


Leu 


Ser 
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Tvr His His Leu Pro Leu Asp Leu Arg Gin Cys Phe Ala Tyr cys Ala 
y 4S0 455 460 

val Phe pro Lys Asp Thr Lys Met lie Lys Glu Asn Leu He Thr Leu 
465 470 475 480 

Trp Met Ala His Gly Phe Leu Leu Ser Lys Gly Asn Leu Glu Leu Glu 
485 490 495 

asp val Gly Asn Glu Val Trp Asn Glu Leu Tyr Leu Arg Ser Phe phe 
500 505 510 

Gin Glu lie Glu Ala Lys Ser Gly Asn Thr Tyr Phe Lys lie His Asp 
515 520 525 

Leu He His Asp Leu Ala Thr ser Leu Phe ser Ala ser Ala Ser Cys 
530 535 540 

Gly Asn lie Arg Glu lie Asn val Lys Asp Tyr Lys His Thr val Ser 
545 550 555 560 

lie Gly Phe Ala Ala val val Ser Ser Tyr Ser Pro ser Leu Leu Lys 
565 570 575 

lvs Phe val Ser Leu Arg Val Leu Asn Leu Ser Tyr ser Lys Leu Glu 
y 580 585 590 

Gin Leu Pro Ser Ser lie Gly Asp Leu Leu His Leu Arg Tyr Leu Asp 
595 600 605 

Leu Ser Cys Asn Asn Phe Arg Ser Leu pro Glu Arg Leu cys Lys Leu 
610 615 620 

Gin Asn Leu Gin Thr Leu Asp val His Asn Cys Tyr Ser Leu Asn cys 
625 630 635 640 

Leu Pro Lys Gin Thr ser Lys Leu ser Ser Leu Arg His Leu val val 
64 5 650 655 

Asp Gly Cys Pro Leu Thr Ser Thr Pro Pro Arg lie Gly Leu Leu Thr 
660 665 670 

Cys Leu Lys Thr Leu Gly Phe Phe lie val Gly Ser Lys Lys Gly Tyr 
67 5 680 685 

Gin Leu Gly Glu Leu Lys Asn Leu Asn Leu Cys Gly ser lie Ser lie 
690 695 700 

Thr His Leu Glu Arg val Lys Asn Asp Thr Asp Ala Glu Ala Asn Leu 
705 710 715 720 

ser Ala Lys Ala Asn Leu Gin ser Leu ser Met Ser Trp Asp Asn Asp 
725 730 735 

Gly Pro Asn Arg Tyr Glu Ser Lys Glu val Lys val Leu Glu Ala Leu 
y 740 745 



750 



Lys Pro His Pro Asn Leu Lys Tyr Leu Glu lie lie Ala Phe Gly Gly 
755 760 765 



Phe Arg Phe Pro ser Trp lie Asn His Ser Val Leu Glu Lys val lie 
770 775 780 

Ser val Arg lie Lys ser Cys Lys Asn Cys Leu cys Leu Pro Pro Phe 

785 " 790 795 800 

Gly Glu Leu Pro Cys Leu Glu Asn Leu Glu Leu Gin Asn Gly Ser Ala 
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805 810 815 

Glu val Glu Tyr val Glu Glu Asp Asp val His Ser Arg Phe Ser Thr 
820 825 830 

Arg Arg ser phe Pro ser Leu Lys Lys Leu Arg lie Trp Phe Phe Aro 
83 5 840 845 

Ser Leu. Lys Gly Leu Met Lys Glu Glu Gly Glu Glu Lys Phe Pro Met 
850 855 860 

Leu Glu Glu Met Ala He Leu Tyr cys Pro Leu Phe val Phe Pro Thr 
865 870 875 880 

Leu Ser ser val Lys Lys Leu Glu val His Gly Asn Thr Asn Thr Arg 
885 890 895 

Gly Leu Ser Ser lie Ser Asn Leu Ser Thr Leu Thr ser Leu Arg lie 
900 905 910 

Gly Ala Asn Tyr Arg Ala Thr Ser Leu Pro Glu Glu Met Phe Thr Ser 
20 915 920 925 

Leu Thr Asn Leu Glu Phe Leu Ser Phe Phe Asp Phe Lys Asn Leu Lys 
930 935 940 



w 



15 



25 



30 



35 



40 



Asp Leu Pro Thr Ser Leu Thr ser Leu Asn Ala Leu Lys Arg Leu Gin 
945 950 955 960 

lie Glu Ser Cys Asp ser Leu Glu Ser Phe Pro Glu Gin Gly Leu Glu 
965 970 975 

Gly Leu Thr Ser Leu Thr Gin Leu Phe Val Lys Tyr Cys Lys Met Leu 
980 985 990 

Lys cys Leu Pro Glu Gly Leu Gin His Leu Thr Ala Leu Thr Asn Leu 
995 1000 1005 

Gly val ser Gly Cys Pro Glu val Glu Lys Arg cys Asp Lys Glu He 
1010 1015 1020 

Gly Glu Asp Trp His Lys lie Ala His lie Pro Asn Leu Asp He His 
1025 1030 1035 1040 



Claims 

1 . An isolated or recombinant nucleic acid or functional fragment thereof corresponding to one of a cluster of genes 
45 identifiable by phylogenetic tree analyses as corresponding to the Rpi-bJb, RGC1-btb, RGC3-blb and RGC4-blb 

cluster of figure 9. 

2. A nucleic acid according to claim 1 said nucleic acid encoding a gene product that is capable of providing a member 
of the Solanacae family with resistance against an oomycete pathogen, or a functional equivalent thereof. 

50 

3. A nucleic acid according to claim 1 or 2 wherein said member of the Solanacae family comprises S. tuberosum. 

4. A nucleic acid according to claim 1 to 3 where said resistance is race non-specific. 

55 5. a nucleic acid according to claim 1 to 4 comprising a sequence as depicted in figure 6 for Rpi-blb or part thereof. 
6. A nucleic acid according to claim 1 to 5 at least comprising a LRR domain. 
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7. A vector comprising a nucleic acid according to anyone of claims 1 to 6. 

8. A host cell comprising a nucleic acid according to anyone of claims 1 to 6 or a vector according to claim 7. 
5 9. A cell according to claim 8 comprising a plant cell. 

10. A cell according to claim 9 wherein said plant comprises a member of the Sofanacae family. 

11. A plant comprising a cell according to anyone of claims 6 to 10. 

10 

12. A part derived from a plant according to claim 11 . 

13. A part according to claim 12 wherein said tuber comprises a potato or said fruit comprises a tomato. 
15 14. Progeny of a plant according to claim 11 . 

15. A proteinaceous substance exhibiting the characteristic of providing at least partial resistance to an oomycete 
infection such as caused by a Phytophthora species when incorporated and expressed in a plant or plant cell. 

20 16. A proteinaceous substance encoded by a nucleic acid according to anyone of claims 1 to 6. 

17. A proteinaceous substance comprising an amino acid sequence as depicted in figure 8 or part thereof. 

18. A binding molecule directed at a substance according to anyone of claims 15 to 17. 

25 

19. A binding molecule according to claim 1 8 comprising an antibody or fragment thereof. 

20. A binding molecule directed at a nucleic acid according to anyone of claim 1 to 6. 
30 21. A binding molecule according to claim 20 comprising a probe or primer. 

22. A binding molecule according to anyone of claims 18 to 21 provided with a label. 

23. A binding molecule according to claim 22 wherein said label comprises an excitable moiety. 

35 

24. Use of a nucleic acid according to anyone of claims 1 to 6 or a vector according to claim 7 or a cell according to 
anyone of claims 8 to 11 or a substance according to anyone of claims 15 to 17 or a binding molecule according 
to anyone of claims 1 8 to 23 in a method for providing a plant or its progeny with resistance against an oomycete 
infection. 

40 

25. Use according to claim 24 wherein said oomycete comprises Phytophthora infestans. 

26. Use according to claim 24 or 25 wherein said plant comprises S. tuberosum. 

*5 27. A method for providing a plant or its progeny with at least partial resistance against an oomycete infection com- 
prising providing said plant or part thereof with a gene or functional fragment thereof comprising a nucleic acid 
corresponding to one of a cluster of genes identifiable by phylogenetic tree analyses as corresponding to the Rpi- 
blb, RCGUblb, RCG3-b/bar\6 RCG4-bfbc\u$ter of figure 9, said nucleic acid encoding a gene product that is capable 
of providing a member of the Sofanacae with resistance against an oomycete fungus, or providing said plant or 

50 part thereof with a nucleic acid according to anyone of claims 1 to 4 or a vector according to claim 5 or a cell 

according to claim 6 or a substance according to anyone of claims 15 to 18. 

28. A method for selecting a plant or plant material or progeny thereof for its susceptibility or resistance to an oomycete 
infection comprising testing at least part of said plant or plant material or progeny thereof for the presence or 
55 absence of a nucleic acid corresponding to one of a cluster of genes identifiable by phylogenetic tree analyses as 

corresponding to the Rpi-btb, RCG1-blb, RCG3-blb and RCG4 -bf bc\usier of figure 9, said nucleic acid encoding a 
gene product that is capable of providing a member of the So/anacaewith resistance against an oomycete fungus. 
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29. A method according to claim 28 comprising contacting at least part of said plant or plant material or progeny thereof 
with a binding molecule according to anyone of claims 19 to 23 and determining the binding of said molecule to 
said part. 

30. A method according to claim 29 wherein said oomycete comprises Phytophthora infestans. 

31 . A method according to claim 28 or 29 wherein said plant comprises S. tuberosum. 

32. An isolated S. bulbocastanum, or part thereof, susceptible to an oomycete infection caused by Phytophthora in- 
festans. 
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Figure 4 
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Figure 5 
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Figure 6A 



1 ATGGCTGAAGCTTTCATTCAAGTTCTGCTAGACAATCTCACTTCTTTCCT 
5 1 CAAAGGGGAACTTGTATTGCTTTTCGGTTTTCAAGATGAGTTCCAAAGGC 
101 TTTC AAGCATGTTTTCTACAATTC AAGCCGTCCTTGAAGATGCTCAGGAG 
151 AAG C AAC TC AAC AAC AAGC CTC TAGAAAATTGGTTGC AAAAACTCAATGC 

2 01 TGCTACATATGAAGTCGATGACATCTTGGATGAATATAAAACCAAGGCCA 
251 CAAGATTCTCCCAGTCTGAATATGGCCGTTATCATCCAAAGGTTATCCCT 
301 TTCCGTC AC AAGGTCGGGAAAAGGATGGACC7VAGTGATGAAAAAACTAAA 

3 51 GGCAATTGCTGAGGAAAGAAAGAATTTTCATTTGCACGAAAAAATTGTAG 

4 01 AG AGAC AAGCTGTTAGACGGG AAACAG GTTCTGTATT AAC CG AA CCGC AG 
4 51 GTTTATGGAAGAGACAAAGAGAAAGATGAGATAGTGAAAATCCTAATAAA 
501 C AATGTTAGTGATGCCCAACACCTTTC AGTCCTCCCAATACTTGGTATGG 
551 G G G G A TT AGG AAAAAC G AC TC T TGCC C AAATG GTC TTC AATG AC CAG AG A 
601 GTTACTGAGCATTTCCATTCCAAAATATGGATTTGTGTCTCGGAAGATTT 

6 51 TGATGAG AAGAGGTTAATAAAGGCAATTGTAGAATCTATTGAAGGAAGGC 

7 01 CACTACTTGGTGAGATGGACTTGGCTCCACTTCAAAAGAAGCTTCAGGAG 

7 51 TTGCTGAATGGAAAAAGATACTTGCTTGTCTTAGATGATGTTTGGAATGA 

8 01 AGATCAACAGAAGTGGGCTAATTTAAGAGCAGTCTTGAAGGTTGGAGC AA 
8 51 GTGGTGCTTCTGTTCTAACCACTACTCGTCTTGAAAAGGTTGGATCAATT 
901 ATGGGAACATTGCAACCATATGAACTGTCAAATCTGTCTCAAGAAGATTG 
951 TTGGTTGTTGTTC ATGCAACGTGC ATTTGGACACCAAGAAGAAATAAATC 
10 01 C AAACCTTGTGGC AATCGGAAAGG AGATTGTGAAAAAAAGTGGTGGTGTG 
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1051 CCTCTAGCAGCCAAAACTCTTGGAGGTATTTTGTGCTTCAAGAGAGAAGA 

1101 AAGAGC ATGGGAACATGTGAG AGAC AGTCCG ATTTGGAATTTGCCTC AAG 

1151 ATGAAAGTTCTATTCTGCCTGCCCTGAGGCTTAGTTACCATCAACTTCCA 

12 0 1 CTTGATTTGAAACAATGCTTTGCGTATTGTGCGGTGTTCCCAAAGGATGC 

12 51 c AAAATGGAAAAAG AAAAGCTAATCTCTCTCTGGATGGCGC ATGGTTTTC 

13 01 TTTTATC AAAAGG AAACATGGAGCT AGAGGATGTGGGCGATGAAGTATGG 

13 51 AAAGAATTATACTTGAGGTCTTTTTTCCAAGAGATTGAAGTTAAAGATGG 

14 01 T AAAACTTATTTC AAG ATGCATGATCTCATCCATGATTTGGCAACATCTC 

14 51 TGTTTTCAGCAAAC ACATC AAGC AGCAATATCCGTGAAATAAATAAACAC 

15 01 AGTTACACACATATGATGTCCATTGGTTTCGCCGAAGTGGTGTTTTTTTA 

15 51 CACTCTTCCCCCCTTGGAAAAGTTTATCTCGTTAAGAGTGCTTAATCTAG 

16 01 GTGATTCGACATTTAATAAGTTACCATCTTCCATTGGAGATCTAGTACAT 
1651 TTAAGATACTTGAACCTGTATGGCAGTGGCATGCGTAGTCTTCCAAAGC A 

17 01 GTTATGCAAGCTTCAAAATCTGCAAACTCTTGATCTACAATATTGCACCA 

17 51 AGCTTTGTTGTTTGCC AAAAGAAACAAGTAAACTTGGTAGTCTCCGAAAT 
1801 CTTTTACTTGATGGTAGCCAGTCATTGACTTGTATGCCACCAAGGATAGG 

18 51 ATCATTGACATGCCTTAAGACTCTAGGTCAATTTGTTGTTGGAAGGAAGA 

19 01 AAGGTTATCAACTTGGTGAACTAGGAAACCTAAATCTCTATGGCTCAATT 
1951 AAAATCTCGCATCTTGAGAGAGTGAAGAATGATAAGGACGCAAAAGAAGC 
2 0 01 CAATTTATCTGCAAAAGGGAATCTGCATTCTTTAAGCATGAGTTGGAATA 
2 0 51 ACTTTGGACCACATATATATGAATCAGAAGAAGTTAAAGTGCTTGAAGCC 
2101 CTC AAACCACACTCCAATCTGACTTCTTTAAAAATCTATGGCTTCAGAGG 
2151 AATCCATCTCCC AGAGTGGATGAATCACTCAGTATTGAAAAATATTGTCT 
22 01 CTATTCTAATTAGCAACTTC AGAAACTGCTCATGCTTACCACCCTTTGGT 
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2251 G ATCTGCCTTGTCTAGAAAGTCTAGAGTTACACTGGGGGTCTGCGGATGT 

2 3 01 GG AGTATGTTGAAGAAGTGGATATTGATGTTCATTCTGG ATTCCCCACAA 

2351 G AATAAGGTTTCC ATCCTTG AGGAAACTTGATATATGGGACTTTGGT AGT 

2401 CTGAAAGG ATTGCTGAAAAAGGAAGGAGAAGAGC AATTCCCTGTGCTTGA 

2451 AGAGATGATAATTCACGAGTGCCCTTTTCTGACCCTTTCTTCTAATCTTA 

2 5 01 GGGCTCTTACTTCCCTCAGAATTTGCTATAATAAAGTAGCTACTTCATTC 

25 51 CCAGAAGAGATGTTCAAAAACCTTGCAAATCTCAAATACTTGACAATCTC 

2 6 01 TCGGTGCAATAATCTCAAAGAGCTGCCTACCAGCTTGGCTAGTCTGAATG 

2 651. CTTTGAAAAGTCTAAAAATTCAATTGTGTTGCGCACTAGAGAGTCTCCCT 

27 01 G AGGAAGGGCTGG AAGGTTTATCTTCACTCACAGAGTTATTTGTTGAACA 

2 7 51 CTGTAACATGCTAAAATGTTTACC AGAGGGATTGCAGCACCTAACAACCC 

2801 TC ACAAGTTTAAAAATTCGGGGATGTCC ACAACTGATC AAGCGGTGTGAG 

2 851 AAGGGAATAGGAGAAGACTGGC ACAAAATTTCTCACATTCCTAATGTG AA 

2901 TATATATATTTAA 
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Figure 6B 
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ATGGCTGAAGCTTTCATTCAAGTTCTGCTAGACAATCTCACTTCTTTCCT 
CAAAGGGGAACTTGTATTGCTTTTCGGTTTTCAAGATGAGTTCCAAAGGC 
TTTCAAGCATGTTTTCTACAATTCAAGCCGTCCTTGAAGATGCTCAGGAG 
AAGCAACTCAACAACAAGCCTCTAGAAAATTGGTTGCAAAAACTCAATGC 
TGCTACATATGAAGTCGATGACATCTTGGATGAATATAAAACCAAGGCCA 
251 CAAGATTCTCCC AGTCTGAATATGGCCGTTATCATCC AAAGGTTATCCCT 

301 TTCCGTCACAAGGTCGGGAAAAGGATGGACCAAGTGATGAAAAAACTAAA 
3 51 GGCAATTGCTGAGG AAAGAAAGAATTTTC ATTTGCACGAAAAAATTGTAG 

AGAGACAAGCTGTTAGACGGGAAACAGGTACTCATCTTAAATTAGTATTA 
CAACAACTAAGTTTATATTCATTTTTTTGGCAATTA'TCAAATTCAGAAAA 
GGGTTAAATATACTCATGTCCTATCGTAAATAGTGTATATATACCTCTCG 
TTGTACTTTCGATCTGAATATACTTGTCAAATCTGGCAAGCTCAGAATCA 
AATTATCCACCCCAACTTTTAAATACTCGATATCTTTAGAAATCCACCTG 
TCTAACTCATCCACTACCCATTCCCTTTGCTTTGAATTCTTTTCTTTACC 
7 01 TATAAACTTGGAACACTCGATCCGTTTTGCTTTTCTTAACAAAGCAGCTC 
7 51 AGAGAAAAGAGGTTTTCTTCTATTCTGTTTCTCTGTGTGCTGCACTTGGG 
801 TCC TTAATC CCATTAAAAACAGGGCATGTTAATCCCAACGACGGTAGCCT 

TTCCTGACAGCTGACTGTAAATTTTGTCTAACAAAGAAAAAAAAAGATTA 
GACATGTTTTTCCTTGTCATTGATTAGGCTGGATTTCTTTCAGAGTGGAA 
CATAGGGGATATATTGGACCAAAAGTAGAATGGGTATATATTTAAAGTAT 
1001 TTCTGATAGAACAGGAGTATATTGTGCGAAAATATCCTCTATTTTCXGTT 
1051 GTCTCCTAATGAGTTTGAATGTAATAATATTCTCATGTGGACATTGCTTG 
CACC AGGTTC TGT ATTAAC CGAAC C GC AGGTTT ATGGAAGAGAC AAAG AG 
AAAGATGAGATAGTGAAAATCCTAATAAACAATGTTAGTGATGCCCAACA 
CCTTTCAGTCCTCCCAATACTTGGTATGGGGGGATTAGGAAAAACGACTC 
TTGCCCAAATGGTCTTCAATGACCAGAGAGTTACTGAGCATTTCCATTCC 
AAAAT ATG G ATTTGTGTC TCGG AAGATTTTGATG AGAAG AGGTT AAT AAA 
GGCAATTGTAGAATCTATTGAAGGAAGGCCACTACTTGGTGAGATGGACT 
TGGCTCCACTTCAAAAGAAGCTTCAGGAGTTGCTGAATGGAAAAAGATAC 
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1451 TTGCTTGTCTTAG ATG ATGTTTGGAATGAAGATCAAC AGAAGTGGGCTAA 

15 01 TTTAAGAGC AGTCTTGAAGGTTGGAGCAAGTGGTGCTTCTGTTCTAACCA 

15 51 CTACTCGTCTTG AAAAGGTTGGATCAATT ATGGGAAC ATTGCAACCATAT 

16 01 GAACTGTCAAATCTGTCTC AAGAAGATTGTTGGTTGTTGTTCATGCAACG 

16 51- TGCATTTGGACACCAAGAAGAAATAAATCCAAACCTTGTGGCAATCGGAA 

17 01 AGG AGATTGTGAAAAAAAGTGGTGGTGTGCCTCTAGC AGCC AAAACTCTT 
1751 GG AGGT ATTTTGTGCTTC.AAG AGAGAAG AAAGAGC ATGGGAACATGTGAG 
1801 AGACAGTCCGATTTGGAATTTGCCTCAAGATGAAAGTTCTATTCTGCCTG 

18 51 CCCTGAGGCTTAGTTACCATC AACTTCC ACTTG ATTTG AAACAATGCTTT 

19 01 GCGTATTGTGCGGTGTTCCC AAAGGATGCCAAAATGGAAAAAGAAAAGCT 

19 51 AATCTCTCTCTGGATGGCGCATGGTTTTCTTTTATCAAAAGGAAACATGG 
2001 AGCTAGAGGATGTGGGCGATGAAGTATGGAAAGAATTATACTTGAGGTCT 

20 51 TTTTTCCAAGAGATTGAAGTTAAAGATGGTAAAACTTATTTCAAGATGCA 
2101 TG ATCTCATCC ATGATTTGGC AAC ATCTCTGTTTTCAGC AAACAC ATC AA 
2151 GCAGCAATATCCGTGAAATAAATAAACACAGTTACACACATATGATGTCC 
22 01 ATTGGTTTCGCCGAAGTGGTGTTTTTTTACACTCTTCCCCCCTTGGAAAA 

22 51 GTTTATCTCGTTAAGAGTGCTTAATCTAGGTGATTCGACATTTAATAAGT 

23 01 TACC ATCTTCCATTGGAG AT C TAGTAC ATTTAAGAT ACTTG AACCTG TAT 
2 351 GGCAGTGGCATGCGTAGTCTTCCAAAGCAGTTATGC AAGCTTCAAAATCT 
2 4 01 GCAAACTCTTGATCTACAATATTGCACCAAGCTTTGTTGTTTGCCAAAAG 
2451 AAACAAGTAAACTTGGTAGTCTCCGAAATCTTTTACTTGATGGTAGCCAG 
2 5 01 TC ATTGACTTGTATGCCACCAAGGATAGGATCATTGACATGCCTTAAGAC 
2551 TCT AGGTC AATTTGTTGTTGGAAGGAAGAAAGGTTATCAACTTGGTGAAC 
2 6 01 TAGGAAACCTAAATCTCTATGGCTCAATTAAAATCTCGCATCTTGAGAGA 
2 6 51 GTGAAGAATGATAAGGACGC AAAAGAAGCCAATTTATCTGCAAAAGGGAA 
27 01 TCTGCATTCTTTAAGC ATGAGTTGG AATAAGTTTGGACC AC ATATATATG 
2751 AATCAGAAGAAGTTAAAGTGCTTGAAGCCCTCAAACC ACACTCCAATCTG 
2 801 ACTTCTTTAAAAATCTATGGCTTC AGAGGAATCCATCTCCCAGAGTGGAT 
2 8 51 GAATCACTCAGTATTGAAAAATATTGTCTCTATTCTAATTAGCAACTTCA 
2 9 01 GAAACTGCTCATGCTTACCACCCTTTGGTGATCTGCCTTGTCTAGAAAGT 

2 951 CTAGAGTTACACTGGGGGTCTGCGGATGTGGAGTATGTTGAAGAAGTGGA 

3 001 TATTGATGTTCATTCTGG ATTCCCCACAAGAATAAGGTTTCCATCCTTGA 
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30 51 G GAAACTTGATATATGGGACTTTGGTAGTCTGAAAGGATTGCTGAAAAAG 

3101 G AAGGAG AAGAGCAATTCCCTGTGCTTGAAGAGATGATAATTCACGAGTG 

3151 cccTTTTCTGACCCTTTCTTCTAATCTTAGGGCTCTTACTTCCCTCAGAA 

3 201 ttTGCTATAATAAAGTAGCTACTTCATTCCC AGAAGAG ATGTTCAAAAAC 

3251 CTTGCAAATCTCAAATACTTGACAATCTCTCGGTGCAATAATCTCAAAGA 

3 3 01 GCTGCCTACCAGCTTGGCTAGTCTGAATGCTTTGAAAAGTCTAAAAATTC 

3351 AATTGTGTTGCGCACTAGAGAGTCTCCCTGAGGAAGGGCTGGAAGGTTTA 

3401 TCTTCACTC ACAGAGTTATTTGTTGAACACTGTAACATGCT AAAATGTTT 

3 451 ACC AGAGGGATTGCAGCACCTAACAACCCTCACAAGTTTAAAAATTCGGG 

3501 GATGTCC AC AACTGATCAAGCGGTGTGAGAAGGGAATAGGAGAAGACTGG 

3 551 CACAAAATTTCTCACATTCCTAATGTGAATATATATATTTAA 
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Figure 6C 

1 GATCTTTTAAATATTTTGAATTAGCAATTATTGTGACTATAATACTTTTT 
5 1 ACATAATTTGC AAATATATAAAATTTATTTTTTGAAAAAAAGAAGATTTC 
101 ATGCGC AAATTCC AGGTCAAACTTAAATTATTAGACTCTCGAAAAATGAA 
151 AAGTGTC ACATAAATTGACAC AAAGGGAGTACTTGTTAATGTTGTAATTA 

2 01 TTGGCGAACAATAATGTTGTTGATTATCACTTTCTGAATAAATGTTGTGT 

2 51 CACTTGGAAAAAAC ACCAAATAGAACTATTCATGTTTTTTCTTTAGTATA 

3 01 TATAAAT ATGATCTTTAACTTAATTGC AGC AGAC AGGC ATGATCTTTAAC 
3 51 TTTAAATGTGCACAAGTAGATTGACAGGCTTGCTAATTGAGTGTCTGTTA 
401 T AATCAGTATTAATTACTCTC AAGGTAATAGTATATTCCAGACAAATTTT 
451 GTGTTACCAAATTAAATATATTTCTAAAACTCTCCTC AAAGTAGTTAATA 
501 TACTTTTGAGTGTTGTATCATGTTTTTAATATAAAATGTTAAAATTTAGA 
551 TG AAATTTACTTTCTAGTTAAATTGGTCAAAGTTGAAAG AATTTCAAGTG 
601 AAAAAGTTTTTAATAATTTGACTTTTATGCTATATTTTTTT AAAGTTGAA 
651 CGACTTTTTAATAAAAAAGAATAATAAAATTATATGATAATTTTTAT AAT 
701 AC AATGG CCTTT AT ATG ATG AAAAAAAAAG AAAG AAATT AGATG AC AACA 
7 51 ATGTCCAAAAATAATCTTAAAG AATTACG ATTTATATATAATAAAATTAA 
801 ATTTAAAATTTG ATGAAAAAATAGAG AAAAGAGGAAGATGATGAAGTGAA 
851 ATGACGTGGTGGTGGGTCCATGTGAC ATAAAAAAAAATTCTCTT AAATAA 
901 TCCTTTC ATACTAATGATAAATTTTTTTTTTTTTTTTTTTTTTTACTAAT 
951 TGCGTATAGAGAAAAGGAAAATGGGGCGGTAATTACAAAGTAGGGAATCG 
1001 AACTTTATC AAC AAGTTGAGAGTTCAAGTAATCAACC AACTAAACTACTA 
10 51 AAATTTTTCTAATTAATGATAATTGTAATTC ATTTAGCATAAAAAATTTC 
1101 ATTGCACTTACTTTTAGAGTTTTGAAAACAGTACTTCATCTATTCTATAT 
1151 TAATTAAATTTTCTATATTAATTAAATTTGTGAGGTAATACAAACTT ATT 
12 01 AAG AAAAATATTTAAGGACATAATTTAACTCATATTTTTCACTATTGTTT 

12 51 TTTGTG AAATC ATAAATATAACTTTGTAAATAGTGCAATTTATCTCCTAG 

13 01 AAGCAAATTTC ACC AAAGAAAAGGGCAAAGATGGAAAAG AAACTAAATAT 

13 51 TCATCTTAAACTTTGAAC AATTCAATTATTTTG AACAATGAAAAAAATCT 

14 01 C AAAAATTCAATTAATATGAAATGGAGAGAGTAACTTTATTTTAGAGGCA 
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14 51 AAAAATTAGTACTCCATCCGTTCACTTTGATTTGTC ATGTTGCACTTTTC 

15 01 GAAAGTCAATTTGACTAATTTTTAAAGCTAAATTAGATTACACTAATTCA 

15 51 ATATTTTAAAC AGAAAAATTAGATATTCAAAAACTATACAAAAAATATTA 

16 01 TAC ATTGCAATTTTTTGCATATCAATATGATAAAAAAATATATCGTAAAA 

16 51 TATTAGTCAAAATTTTTATAATTTGACTCAAATCATGAAAAGTATAATAA 

17 01- TTAATAGTGG ACGGAGGAAGTATTGTCTTTCCAGATTTGTGGCC ATTTTT 
17 51 GGTCCAAGGGCCATTAGCAGTTCTCTTCATTTTCTACTTCTGTCTCATAT 
1801 TAGATGGGCATCTTACTAAAAATATTTGTCTCATATTACTTGATTATTTA 
1851 TTAAATCAAAAAGAATTAATTAATTTTTTCTCATTTTACCCCTACAATTA 
1901 ■ ATATAGTTTTAAAAGTTTTAAAC AAATTTTG AAGAATCAAAATTTCTTTT 

GCAAGAGACTTATTAATATAAACAAAGGATAAAATAATAAAAGCTGTCAA 
TTTATTGACCATCACTTAATAATATATAAAATACAAACTGCTGATCTAAT 
2051 ATGAGACGGACAAAATATATTCTAAAATATTTTCGG ACAG ATATGTGATA 
2101 TTCTAACCATTCACTACACTATATTATGCATTTTATCCGCCAATGACTTA 
TTTCAGCTTTAATTAATTAGGAAAGAGGAAACTGCCAATGAGGAAGAGTA 
GGGGCGTAGTTGCTGTCGACGAAAAAAAGATAATACTCACTCTTTTCGAT. 
TTTT ATTTTT ATTTATCACTTTTAACCTATCATGTAAAAAGATAATT ATT 
23 01 TTTTTCATGCTTTATCCTTAGTATTAAACAATTTAATAGGGATTATTTTG 
23 51 TAAAATATTTATATGAATAATTGTTTTCGTAATGAATTTGTCCGGTCAAA 
2 401 C AATG ATAAATAAAAATGAATGAAG AGAGTAGAAAACAAAAC AAAAGAAC 
2451 AAGTTGACAACTTG AGAGATTAAAAGGGTCCAAAACGCCTTGGATTTTGA 
2 501 GATTCC ATATGTGAAATTTCC ATG AAATAATTGAATTTGTATTATTACAA 
2551 GTCAAACTTTCC ATTTCATTCCAACTAG CC ATCTTGGTTTCAAAATTACA 
2 601 C ATTCATTCATTCAC AGATCTAATATTCTTAATAGTGATTTCC ACATATG 
2 651 GCTGAAGCTTTCATTCAAGTTCTGCTAGACAATCTCACTTCTTTCCTCAA 
2701 AGGGGAACTTGTATTGCTTTTCGGTTTTCAAGATGAGTTCCAAAGGCTTT 
27 51 CAAGCATGTTTTCTACAATTCAAGCCGTCCTTGAAGATGCTCAGGAGAAG 
2 801 c AACTCAAC AACAAGCCTCTAGAAAATTGGTTGCAAAAACTCAATGCTGC 
2851 TAC ATATGAAGTCGATGAC ATCTTGGATGAATATAAAACCAAGGCC ACAA 

2 901 G ATTCTCCCAGTCTGAATATGGCCGTTATCATCCAAAGGTTATCCCTTTC 
29 51 CGTCAC AAGGTCGGGAAAAGGATGG ACCAAGTGATGAAAAAACTAAAGGC 

3 001 AATTGCTGAGGAAAGAAAGAATTTTCATTTGC ACGAAAAAATTGTAGAG A 
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3 051 G AC AAGCTGTTAGACGGGAAACAGGTACTCATCTTAAATTAGTATTACAA 

3101 CAACTAAGTTTATATTCATTTTTTTGGCAATTATCAAATTCAGAAAAGGG 

3151 TTAAATATACTCATGTCCTATCGTAAATAGTGTATATATACCTCTCGTTG 

3 2 01 TACTTTCGATCTGAATATACTTGTCAAATCTGGCAAGCTCAGAATCAAAT 

3 251 TATCCACCCCAACTTTTAAATACTCGATATCTTTAGAAATCCACCTGTCT 

3 3 01 AACTCATCCACTACCCATTCCCTTTGCTTTGAATTCTTTTCTTTACCTAT 

3 3 51 AAACTTGGAACACTCGATCCGTTTTGCTTTTCTTAACAAAGCAGCTC AGA 

3 4 01 GAAAAGAGGTTTTCTTCTATTCTGTTTCTCTGTGTGCTGCACTTGGGTCC 

3 4 51 TTAATCCCATTAAAAACAGGGCATGTTAATCCCAACGACGGTAGCCTTTC 

3 5 01 C TG AC AGC TG AC TGT AAATTTTGT CT AAC AAAG AAAAAAAAAG ATT AG AC 

3 551 ATGTTTTTCCTTGTCATTGATTAGGCTGGATTTCTTTCAGAGTGGAACAT 

3 6 01 AGGGGAT ATATTGGACCAAAAGTAGAATGGGTATATATTTAAAGTATTTC 

3 6 51 TGATAGAACAGGAGTATATTGTGCGAAAATATCCTCTATTTTCTGTTGTC 

37 01 TCCTAATGAGTTTGAATGTAATAATATTCTCATGTGGACATTGCTTGCAC 

37 51 C AGGTTCTGTATTAACCGAACCGCAGGTTTATGGAAGAGACAAAGAGAAA 

3 8 01 GATGAGATAGTGAAAATCCTAATAAACAATGTTAGTGATGCCCAACACCT 

3 851 TTCAGTCCTCCC AATACTTGGTATGGGGGGATTAGGAAAAACGACTCTTG 

3 901 CCCAAATGGTCTTCAATGACCAGAGAGTTACTGAGCATTTCCATTCCAAA 
3951 ATATGGATTTGTGTCTCGGAAGATTTTGATGAGAAGAGGTTAATAAAGGC 
40 01 AATTGTAGAATCT ATTGAAGGAAGGCCACTACTTGGTGAGATGGACTTGG 
4051 CTCC ACTTC AAAAGAAGCTTC AGG AGTTGCTG AATGG AAAAAGATAC TTG 
4101 CTTGTCTTAGATGATGTTTGGAATGAAGATCAACAGAAGTGGGCTAATTT 
4151 AAGAGCAGTCTTG AAGGTTGGAGC AAGTGGTGCTTCTGTTCTAACCACTA 
42 01 CTCGTCTTGAAAAGGTTGGATCAATTATGGGAACATTGC AACCATATGAA 

4 2 51 CTGTC AAATCTGTCTCAAGAAGATTGTTGGTTGTTGTTCATGCAACGTGC 
4301 ATTTGGACACCAAGAAGAAATAAATCCAAACCTTGTGGC AATCGGAAAGG 
4 3 51 AGATTGTGAAAAAAAGTGGTGGTGTGCCTCTAGCAGCCAAAACTCTTGGA 
44 01 GGTATTTTGTGCTTCAAGAGAGAAGAAAGAGCATGGGAACATGTGAGAGA 
44 51 C AGTCCGATTTGGAATTTGCCTCAAGATGAAAGTTCTATTCTGCCTGCCC 
4 5 01 TGAGGCTTAGTTACCATCAACTTCCACTTGATTTGAAACAATGCTTTGCG 
4 5 51 TATTGTGCGGTGTTCCCAAAGGATGCC AAAATGGAAAAAGAAAAGCTAAT 
4 6 01 CTCTCTCTGG ATGGCGCATGGTTTTCTTTTATCAAAAGGAAACATGGAGC 
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4651 T AG AGGATGTGGGCGATGAAGTATGGAAAGAATTATACTTGAGGTCTTTT 

4 7 01 TTCCAAGAGATTGAAGTTAAAGATGGTAAAACTTATTTCAAGATGCATGA 

47 51 TCTCATCCATGATTTGGCAACATCTCTGTTTTCAGCAAACACATCAAGCA 
4801 GC AATATCCGTGAAATAAATAAACACAGTTACAC AC ATATGATGTCCATT 

48 51- GGTTTCGCCG AAGTGGTGTTTTTTTACACTCTTCCCCCCTTGGAAAAGTT 

49 01 t ATCTCGTTAAGAGTGCTTAATCTAGGTGATTCG AC ATTTAATAAGTTAC 
4951 C ATCTTCCATTGGAGATCTAGTACATTTAAGATACTTGAACCTGTATGGC 
5001 AGTGGC ATGCGTAGTCTTCCAAAGCAGTTATGCAAGCTTCAAAATCTGCA 
5051 AACTCTTGATCTACAATATTGC ACCAAGCTTTGTTGTTTGCCAAAAG AAA 
5101 C AAGTAAACTTGGTAGTCTCC GAAATCTTTT ACTTG ATGGTAGCC AGTC A 
5151 TTGACTTGTATGCCACCAAGGATAGGATCATTGACATGCCTTAAGACTCT 
5201 AGGTC AATTTGTTGTTGGAAGG AAGAAAGGTTATCAACTTGGTGAACTAG 
5251 GAAACCT AAATCTCTATGGCTCAATTAAAATCTCGC ATCTTGAGAGAGTG 
53 01 AAGAATGATAAGGACGCAAAAGAAGCCAATTTATCTGCAAAAGGGAATCT 
5351 GCATTCTTTAAGCATGAGTTGGAATAACTTTGGACC ACATATATATGAAT 
5401 C AG AAG AAGTTAAAGTGCTTGAAGCCCTCAAACC AC ACTCCAATCTG ACT 
5451 TCTTTAAAAATCTATGGCTTC AGAGGAATCCATCTCCCAGAGTGGATGAA 
5501' TCACTCAGTATTGAAAAATATTGTCTCTATTCTAATTAGCAACTTCAGAA 
5 551 ACTGCTCATGCTTACC ACCCTTTGGTGATCTGCCTTGTCTAGAAAGTCTA 
5601 G AGTTAC ACTGGGGGTCTGCGG ATGTGGAGTATGTTGAAGAAGTGG AT AT 
5651 TGATGTTCATTCTGGATTCCCCACAAGAATAAGGTTTCCATCCTTGAGGA 
57 01 AACTTGATATATGGGACTTTGGTAGTCTGAAAGGATTGCTGAAAAAGGAA 
57 51 GGAGAAGAGC AATTCCCTGTGCTTGAAGAGATGATAATTCACGAGTGCCC 
5801 TTTTCTGACCCTTTCTTCTAATCTTAGGGCTCTTACTTCCCTCAGAATTT 
5 851 GCTATAATAAAGTAGCTACTTCATTCCCAGAAGAGATGTTCAAAAACCTT 
5901 GC AAATCTCAAATACTTGACAATCTCTCGGTGCAATAATCTCAAAGAGCT 
59 51 GCCT AC C AGC TTGG CT AGTC TG AAT GC TTTG AAAAGTCT AAAAATTC AAT 
6001 TGTGTTGCGCACTAGAGAGTCTCCCTGAGGAAGGGCTGGAAGGTTTATCT 
6051 TCACTCACAGAGTTATTTGTTGAACACTGTAACATGCTAAAATGTTT ACC 
6101 AGAGGGATTGCAGCACCTAACAACCCTCACAAGTTTAAAAATTCGGGGAT 
6151 GTCCACAACTGATC AAGCGGTGTGAGAAGGGAATAGGAGAAGACTGGCAC 
62 01 AAAATTTCTC ACATTCCTAATG TGAAT ATATATATTTAAGTTATTTGCTA 
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62 51 TTGTTTCTTTGTTTGTGAGTCTTTTTGGTTCCTGCCATTGTGATTGCATG 

63 01 TAATTTTTTTCTAGGGTTGTTTGTTTGTTGAGTCTCTCTCTCATTGGATG 

63 51 TAATTCTCTTTTGGTAACAAATTAACAATCTATTTGTATTATACGCTTTC 
6401 AGAATCTATTACTTATTTGTAATTGTTTCTTTGTTTGTAAATTGTGAGTA 

64 51 TCTTATTGTATGGAATTTTCTGATTTTATTTTGAAAACAAATCAATAAGA 
6501 . TCCATCTGTATTATACTCCCTTCGTCTCATTTTATGTGACACTTTTTGGA 
6 5 51 TTTCG AGATTCTTTGATCTTAAATTTTTCATAGATCTTTTAAACATTTTG 
6601 AATTATC AATTATTGAGATTTTAGTATTTTTTATGTAGTTTACAAATATA 
6651 TAAAATTAATTTTTTAAAAAAAAGAAGATTTCATGCGCATATTCCCGATC 
6 701 AAACTTAAATTACTAGACTCTCGAAAAATGAAAAGTGTCACATAAATTGA 

67 51 GACAGAGGGAGTACTTGTTAATGTTGTAATTATTGGCGAACAATAATGTT 

68 01 GGTGATTATCACTTTCTGAATAAATGTTGTGTCACGTGGAAAAAACACC A 
6851 AATAGAAGTATTCATGCTTTTTTAGTATATATAAACATGATTTTTAACTT 
6901 GGTTTCAGCGGATAGTCATGACCTTTAACTCTGAATGTGC AC AAGTAGAT 

69 51 ACTTGTATAAAATTAAACAAATTTTATAAAATTATACAATATGACACTGA 
7001 G AGT AATTG AT ACCAATTGCAGTCGTTGCTGCTTTTCG ATTCTCTGTC AT 
7051 TC TCT AGGT AATTG ATTTTAC AG AAAAGGGCCAAAAAT ATCCCTG AAGT A 
7101 C C AG AAAAGGTCTC AAAATACCAAC C ATCCAC ATTTTGGTCTAAAAAT AT 
7151 CCTTCTACTCATCCTTTTTTGTCTAAAATTACCCTTTCATCCACATTTTT 
7201 GCTC ACTTATACCCTTATAACAACTCTCTCCTTTTTTAAAAAAAAATATT 

72 51 TATTATGTGTCATTTTCTTATTGAATGAAATAAAAATCCACCTCTATTAA 

73 01 TTTTTTCCCATAATTTATCCAAATC AAAACAATATATTTTTTC AAGATC 
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Figure 6D 

1 atgGCTGAAGCTTTCCTTCAAGTTCTGCTAGATAATCTCACTTTTTTCAT 

5 1 cCAAGGGGAACTTGGATTGGTTTTTGGTTTCGAGAAGGAGTTTAAAAAAC 

101 ' * TTTCAAGTATGTTTTCAATGATCCAAGCTGTGCTAGAAGATGCTCAAGAG 

151 AAGCAACTGAAGTACAAGGCAATAAAGAACTGGTTACAGAAACTCAATGT 

201 TGCTGC ATATGAAGTTGATGACATCTTGGATGACTGTAAAACTGAGGCAG 

2 51 CAAGATTC AAGC AGGCTGTATTGGGGCGTTATCATCCACGGACCATC ACT 

3 01 TTCTGTTACAAGGTGGGAAAAAGAATGAAAGAAATGATGGAAAAACTAGA 

3 51 TGCAATTGCAGAGGAACGGAGGAATTTTC ATTTAGATGAAAGGATTATAG 

4 01 AGAGACAAGCTGCTAGACGGCAAAC AGGTGCTCATCTTAATTTTATTTTA 
451 AAACAAAT AAGT AT T ACAAATTGC AGAGAAACGAAGGAATTTAT ATTCAT 
501 TTTTATTTTTGGCAATTATCAAAGTCATTTGTGTTTTTAAGCTGGGGGGA 
551 AGTTTCAAATATTTTCTCTAGTCTTAATGTTTGTCTCACTCACTCAGCAT 
601 GATTTTCTCAATCCTTCACTTCAACTCCCCCCTACTGTGCAAATATCTTC 
651 TCTATTTTCTGTTGACTCCTAATGAGCTTGAATGTAACAACATTCTTGTT 
7 01 TGGAGCAGGTTTTGTTTTAACTGAGCCAAAAGTTTATGGAAGGGAAAAAG 
7 51 AGGAGGATGAGATAGTGAAAATCTTGATAAACAATGTTAGTTATTCCGAA 
801 GAAGTTCCAGTACTCCCAATACTTGGTATGGGGGGACTAGGAAAGACGAC 
851 TCTAGCCCAAATGGTCTTCAATGATC AAAGAATTACTGAGCATTTCAATC 
901 TAAAGATATGGGTTTGTGTCTCAGATGATTTTGATGAGAAGAGGTTGATT 
951 AAGGCAATTGTAGAATCTATTGAAGGAAAGTCACTGGGTGACATGGACTT 
10 01 GGCTCCCCTCC AGAAAAAGCTTC AGGAGTTGTTGAATGGAAAAAGATACT 
1051 TTCTTGTTTTGGATGATGTTTGGAATGAAGATC AAGAAAAGTGGGATAAT 
1101 CTTAGAGCAGTATTGAAGATTGGAGCTAGTGGTGCTTCAATTCTAATTAC 
1151 TACTCGTCTTG AAAAAATTGGATCAATTATGGGAACTTTGCAACTATATC 
1201 AGTTATC AAATTTGTCTCAAGAAGATTGTTGGTTGTTGTTCAAGC AACGT 

12 51 G C ATTT TGC C AC C AAAC C G AAAC AAGT C CT AAAC TT ATGG AAATCGG AAA 

13 01 GGAGATTGTGAAGAAATGTGGGGGTGTGCCTCTAGCAGCCAAAACTCTTG 
13 51 G AGGCC TTTTACGC TTC AAG AGGG AAG AAAGTG AATGGGAAC ATGTG AG A 
1401 GATAGTGAGATTTGGAATTTACCTCAAGATGAAAATTCTGTTTTGCCTGC 
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1451 CCTGAGGCTGAGTTATCATCATCTTCC ACTTG ATTTGAGACAATGTTTTG 

1501 C ATATTGCGC AG TATTCCC AAAGG AC AC C AAAAT AG AAAAGGAATATCTC 

1551 ATCGCTCTCTGGATGGCACACAGTTTTCTTTTATCAAAAGGAAACATGGA 

1601 GCTAGAGGATGTGGGC AATGAAGTATGGAATGAATTATACTTGAGGTCTT 

1651 TTTTCCAAGAGATTGAAGTTAAATCTGGTAAAACTTATTTCAAGATGCAT 

1701 GATCTCATCC ATGATTTGGCTACATCTATGTTTTCAGCAAGCGCATCAAG 

17 51 CAGAAGTATACGCCAAATAAATGTAAAAGATGATGAAGATATGATGTTCA 

1801 TTGTAACAAATTATAAAGATATGATGTCCATTGGTTTCTCCGAAGTGGTG 

1851 TCTTCTTACTCTCCTTCGCTCTTTAAAAGGTTTGTCTCGTTAAGGGTGCT 

1901 TAATCTAAGTAACTCAGAATTTGAACAGTTACCGTCTTCCGTTGGAGATC 

1951 T AGTAC ATTTAAGAT ACCTTGACCTGTCTGGTAATAAAATTTGTAGTCTT 

20 01 CC AAAG AGGTTGTGC AAGCTTC AAAATCTGCAGACTCTTGATCTAT ATAA 

2051 TTGCCAGTCACTTTCTTGTTTGCCGAAACAAACAAGTAAGCTTTGTAGTC 

2101 TCCGGAATCTTGTACTTGATC ACTGTCCATTGACTTCTATGCCACC AAGA 

2151 AT AGG ATTGTTG A C ATGCCTT AAG AC ACT AGGTTAC T TTGT TGTAGG C G A 

22 01 GAGGAAAGGTTATCAACTTGGTGAACTACGAAATTTAAACCTCCGTGGTG 
2251 . C AATTTC AATCACACATCTTG AG AGAGTGAAAAATG ATATGG AGGC AAAA 

23 01 G AAGCCAATTTATCTGCAAAAGCAAATCTACACTCTTTAAGCATGAGTTG 
23 51 GGATAGACCAAACAGATATG AATCCG AAGAAGTTAAAGTGCTTG AAGCCC 
2401 TC AAACC ACATCCCAATCTGAAATATTTAGAAATCATTG ACTTCTGTGGA 
2451 TTCTGTCTCCCTGACTGGATG AATCACTCAGTTTTGAAAAATGTTGTCTC 
2501 TATTCTAATTAGCGGTTGTGAAAACTGCTCGTGCTTACCACCCTTTGGTG 
2 551 AGCTGCCTTGTCTAGAAAGTCTGGAGTTACAAGACGGGTCTGTGGAGGTG 
2601 G AGTATGTTG AAGATTCTGGATTCCTGACAAGAAG AAGATTTCCATCCCT 
2651 GAGAAAACTTCATATAGGTGGCTTTTGTAATCTGAAAGGATTGC AGAGAA 
2701 TG AAAGG AGC AG AGCAATTCCCCGTGCTTGAAGAGATGAAGATTTCGGAT 
2751 TGCCCTATGTTTGTTTTTCCGACCCTTTCTTCTGTCAAGAAATTAGAAAT 
2801 TTGGGGGGAGGCAGATGCAGGAGGTTTGAGCTCCATATCTAATCTCAGCA 
2 851 CTCTTACATCCCTCAAGATTTTCAGTAACCACACAGTGACTTCACTACTG 
2901 GAAGAGATGTTCAAAAACCTTGAAAATCTCATATACTTGAGTGTCTCTTT 

2 951 CTTGGAGAATCTCAAAGAGCTGCCTACCAGCCTGGCTAGTCTCAAC AATT 

3 001 TG AAGTGTCTGGATATTCGTTATTGTTACGCACTAGAGAGTCTCCCCGAG 
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3 0 51 G AAGGGCTGGAAGGTTTATCTTCACTCACAGAGTTATTTGTTGAACACTG 

3101 TAACATGCTAAAATGTTTACCAGAGGGATTGCAGCACCTAACAACCCTCA 

3151 CAAGTTTAAAAATTCGGGGATGTCCAC AACTGATCAAGCGGTGTGAG AAG 

32 01 GG AATAGGAGAAGACTGGCACAAAATTTCTCACATTCCTAATGTGAATAT 

3 251 ATATATTTAA 
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Figure 6E 

1 ATGGCTGAAGCTTTCATTCAAGTTGTGCTAGACAATCTCACTTCTTTCCT 
5 1 CAAAGGGGAACTTGTATTGCTTTTCGGTTTTCAAGATGAGTTCCAAAGGC 
101 TTTCAAGCATGTTTTCTACAATCCAAGCCGTCCTTGAAGATGCTCAAGAG 
151 AAGCAACTCAACGAC AAGCCTCTAGAAAATTGGTTGCAAAAACTC AATGC 

2 01 TGCTACATATGAAGTCGATGACATCTTGGATGAATATAAAACTAAGGCCA 

2 51 C AAGATTCTTGCAGTCTGAATATGGCCGTTATCATCCAAAGGTTATCCCT 

3 01 TTCCGTCAC AAGGTTGGGAAAAGGATGGACCAAGTGATGAAAAAACTGAA 

3 51 TGCAATTGCTGAGGAACGAAAGAATTTTCATTTGCAAGAAAAGATTATAG 
401 AGAG ACAAGCTGCTACACGGGAAACAGGTACTCATCTTAAATTAGTATTA 

4 51 CAACTTAQTTTATATTCATTTGTTTTGGGCAATGATCAAATTATGTAAAG 
501 GTCAAATATACTCATGTACTACTGAAAATAGTTTAAATATACCTCTAGTT 

5 51 ATACTATTAGTACGAACATACTCCTCCCATATACTTTGGAACAAATATTC 

6 01 CCTTAACGAAATAAGACACGTGAAAAGTTCAGATTCAAATTATCCACCCT 

6 51 CAATTTTAAGATCTGATTTCTTT AGGAAACCACTCATC TCCTCCGTTTTG 

7 01 AGTTCTTAACGAAGCAGCTCAGAGAAAAGAGGTTTTCTTCTGTTCT^TOT 
7 51 CTGCTGCATTTGTGTCTTAATCCAATAACAAACAATACAAATTAATATTA 
801 TGTT G AC GATGAGGGTAGTCTTTCT AGCT AGACATGAAC TGAGTGT AAAT 
851 TTTGTTTTAAGGAAGAAAAAGAAATOATTAGGCTGGATTTCTTTCAGAGT 
901 GGAATATAGGGGGATAAAGTTGGAGC AT AGAGTTCC ATCGTTTATTTCTT 
951 TCCTTAAAGTAACAAGTTCAACAAAATGATATCAAGGTACGGTAATGGAA 
1001 AATTATTAGACACGTCTAAACTACAAAAATGGAATAGAAACTTAAATTAT 
1051 CAGTGACAATATCATCCTTTAATAAAGCTACCAAATTTAAATCATGATAC 
1101 AGAGAAGAAAC C AAAAAAAT TAGGGGTGAATTAT TTGATTCT AT GCTT AT 
1151 C ACATGT C TTC CC ATC AACATCAAAGGAAAAATTGTGC CAAAGT AT AAAC 
1201 GGTGCGGTATATTTGGATTGAAAGTAAAACAGGAGGATACATTTGGACTA 
1251 AAAGTATAAC AAT AAGTATATTTGATCATTTTATGTATCAAATTCATOTG 
1301 GTTTTTGGGGAGAAGGGAAGTTTCAATGTTTTCAATCTGCTCCTCATCTC 
1351 ATCCATATCTCTTTATTGTGCAAAACCCTTCTCTATTTAACTATTTTCTG 
14 01 CCGACTCCTAATGAGCTTGAATGTAACAATATTCTCATCTGGACATTGCT 
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14 51 TGCACCAGGTTCTGTGTTAACTGAACCACAAGTTTATGGAAGGGACAAAG 

1501 AAAAAG ATG AGATAGTGAAAATCCTAATAAAC AATGTTAGTGATGCCCAA 

1551 AAACTCTCAGTCCTCCCAATACTTGGTATGGGGGGACTAGGAAAGAC AAC 

1601 TCTTTCCCAAATGGTCTTCAATGATCAGAGAGTAACTGAGCGTTTCTATC 

1651 C C AAAAT ATGG ATTTGCGTCTCGG ATG ATTTT GATG AG AAG AG GTTG ATA 

17 01 AAGG C AAT AGT AG AATC T ATTG AAGGG AAGTCC CTC AGTG AC ATGG ACTT 

17 51 GGCTCCACTTCAAAAGAAGCTTCAAGAGTTGCTGAATGGAAAAAGATACT 

1801 TCCTTGTCTTAGATG ATGTTTGGAATGAAGATCAACATAAGTGGGCTAAT 

1851 TTAAGAGC AGTCTTGAAGGTTGGAGCAAGTGGTGCATTTGTTCTAACTAC 

1901 TACTCGTC TTG AAAAGGTTGG ATCAATTATGGGAACATTGC AACC AT ATG 

1951 AATTGTC AAATCTGTCTCCAGAGGATTGTTGGTTTTTGTTCATGCAGCGT 

2 001 GC AT TTG G AC AC C AAG AAG AAATAAATCC AAACC TTGTGGC AATCGG AAA 

2051 GGAGATTGTGAAAAAATGTGGTGGTGTGCCTCTAGCAGCCAAGACTCTTG 

2101 G AGGTATTTTGCGCTTCAAGAGAGAAGAAAGAGAATGGGAACATGTGAG A 

2151 GAC AGTCC G ATTTGG AATTTGC CTC AAGATG AAAGTTCTATTCTGCC TGC 

2201 CCTGAGGCTTAGTTACCATCATCTTCCACTTGATTTGAGACAATGCTTTG 

2 2 51 TGTATTGTGCGGTATTCCCAAAGGACACCAAAATGGCAAAGGAAAATCTT 
2 301' ATCGCTTTTTGGATGGCACATGGTTTTCTTTTATCGAAAGGAAATTTGGA 
2 3 51 GCTAGAGGATGTAGGTAATGAAGTATGGAATGAATTATACTTGAGGTCTT 
2 401 TC TTCC AAGAG ATTG AAGTTGAATCTGGTAAAACTTATTTC AAG ATG CAT 
2451 GACCTCATCCATGATTTGGCTACATCTCTGTTTTCAGCAAACAC ATC AAG 
2 501 C AGCAATATTCGTGAAATAAATGCTAATTATGATGGATATATGATGTCGA 
2 551 TTGGTTTTGCTGAAGTGGTATCTTCTTACTCTCCTTCACTCTTGCAAAAG 
2 601 TTTGTCTC ATTAAGGGTGCTTAATCTAAGAAACTCGAACCTAAATCAATT 
2 651 ACC ATCTTCCATTGGAGATCTAGTACATTTAAGATACCTGGACTTGTCTG 
2701 GCAATTTTAGAATTCGTAATCTTCCAAAGAGATTATGCAGGCTTCAAAAT 
2751 CTGCAGACTCTTGATCTACATTATTGCGACTCTCTTTCTTGTTTGCC AAA 
2 801 AC AAAC AAGTAAACTTGGTAGTCTCCGAAATCTTTT ACTTGATGGCTGTT 
2 851 C ATTG ACGTCAACGCC ACC AAGG ATAGGATTGTTGACATGCCTT AAG TCT 
2 901 CTAAGTTGCTTTGTTATTGGCAAGAGAAAAGGTTATCAACTTGGTGAACT 

2 951 AAAAAACCTAAATCTCTATGGCTCAATTTCAATCACAAAACTTGACAGAG 

3 0 01 TGAAGAAAGATAGCGATGCAAAAGAAGCTAATTTATCTGCTAAAGCAAAT 
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3 051 CTGC ACTCTTTATGCCTGAGTTGGGACCTTGATGGAAAACATAGATATGA 

3101 TTCAGAAGTTCTTGAAGCCCTCAAACCACACTCCAATCTGAAATATTTAG 

3151 AAATCAATGGCTTCGGAGGAATCCGTCTCCCAGATTGGATGAATCAATCA 

3 2 01 GTTTTGAAAAATGTTGTCTCTATTAGAATTAGAGGTTGTGAAAACTGCTC 

3 2 51 ATGCTTACCACCCTTTGGTGAGCTGCCTTGTCTAG AAAGTCTAGAGTTAC 

3301 ACACCGGGTCAGC AGATGTGGAGTATGTTGAAGATAATGTTCATCCTGGA 

3 351 AGGTTTCCATCCTTGAGGAAACTTGTTATATGGGACTTTAGTAATCTAAA 

3 401 AGGATTGCTGAAAAAGGAAGGAGAAAAGC AATTCCCTGTGCTTGAAGAGA 

3 451 TGACATTTTACTGGTGCCCTATGTTTGTTATTCCGACCCTTTCTTCTGTC 

3 501 AAGACATTGAAAGTTATTGCGACAGATGCAACAGTTTTGAGGTCCATATC 

3 551 TAATCTTAGGGCTCTTACTTCCCTTGACATTAGCAATAACGTAGAAGCTA 

3 601 CTTCACTCCCAGAAGAGATGTTCAAAAGCCTTGCAAATCTCAAATACTTG 

3 651 AATATCTCTTTCTTTAGGAATCTCAAAGAGTTGCCTACCAGCCTGGCTAG 

3 7 01 TCTCAATGCTTTGAAG AGTCTCAAATTTGAATTTTGTAACGCACTAGAG A 

3 7 51 GTC TC C C AG AGG AAGGGGTG AAAGGTTTAAC TTC AC TC AC CG AGTTG TC T 

3 801 GTCAGTAACTGTATGATGCTAAAATGTTTACCGGAGGGATTGCAGCACCT 

3 851 AAC AGCCCTCAC AACTTTAACAATTACTCAATGTCCAATAGTATTCAAGC 

3 901 GGTGTGAGAGAGGAATAGGAGAAGACTGGC ACAAAATTGCTCAC ATTCCA 

3 9 51 TATTTGACTCTATATGAGTGA 
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Figure 6F 

1 ATGGCGGAAGCTTTTCTTCAAGTTCTGCTAGAAAATCTCACTTCTTTCAT 
5 1 CGGAGATAAACTTGTATTGATTTTCGGTTTCGAAAAGGAATGTGAAAAGC 
101 TGTCG AGTGTGTTTTCCACAATTC AAGCTGTGCTTC AAG ATGCTCAGGAG 
151 AAGCAATTGAAGGAC AAGGCAATTG AGAATTGGTTGC AGAAACTCAATTC 
201 TGCTGCCTATGAAGTTGATGATATATTGGGCGAATGTAAAAATGAGGCAA 

2 51 TAAGATTTGAGCAGTCTCGATTAGGGTTTTATCACCCAGGGATTATCAAT 
301 TTCCGTCACAAAATTGGGAGAAGGATGAAAGAGATAATGGAGAAACTAGA 

3 51 TGCAATATCTGAGGAAAGAAGGAAGTTTCATTTCCTTGAAAAAATTACAG 
401 AG AG AC AAGC TGC C GC TGCT ACGC G TG AAAC AGGTGTGAGT ACTGAGT AA 

4 51 TTGTAGCTTAGTTAATATTCAATTTGTTACCACATCATGTGTTCACCGTG 
501 ATCTCTACAGTAGGATGGCAATGGGGCTGGGCGAGGTTGGAGGTGTGCAG 
551 GTGTGTGGCGOAACCCCAACTTTGAGTCTACATAAGTAGGTACTTAAATT 
601 TGT AT AGAGTTGAAC AAGTAC AAAC GC C TCCTACT TGGTGTC C TTATGCG 
651 TATTATGTCACTTAGGATGC ATGTGTCTACTTGTTCAACTTTATATGAGT 
701 TTAAGTTCTACTTGTGCACACCC AAAGTTGGAGCGCGTAGATGTCAGTTG 
751 AT AC C AAGTT AAAAAGGCAT ATTT AT GAATTATGC CTTT AAATTATGATT 
801 c AATTTTGTATCAGTCTGTCCAAAATATGTTCTAGTGAAAGTGTTAAACT 
851 TAGTCTGGATC T GCT ATTG AAAGTGAATTTTTGTGGCAC TAAACAATGC A 
901 ATGGGTCTGGATTCATTTTTGCATTAACTTTTGTTTAGACGATTTTCTTT 
9 51 AT CGAATT TT AC T GTCTAAAAT GGAAAAAGCAAAGAAATAAGAAGTAT AC 
1001 AGAGGCTGACTTCTTCATAGTATCTATCATATAAAAAAAAGCATTGATTA 
1051 CT AGGAT ATGGGT TC TTTTAAATT AC AAATTTGTGAGTTAAAACAGTTCT 
1101 GT TGGG AAGGATTT AGATAC ACGTGGAT AGT ATCT AGAAGTTT T TTAAAT 
1151 AAAAAATTAGKZAAATTATGCGGGCTGGGGCGGGTTGAAAACAGCAAACTT 

12 01 TGCAAGGCTTGGCGGGTCGAAATCTTTGCAAGTTTGTGTGGGTTTGCCCT 
1251 GC ACCAC CCAATCTGCCATT CCTGTCTAAATGTTTGTTTTGTCTAT AATT 

13 01 CTTGCTGACTCATTCTAATGAGCTCAATTGTAACAAATTCTTTGTGTCCA 
13 51 CACTACTTGGAACAGGTTTTGTGTTAACTGAACC AAAAGTCTACGGAAGG 
1401 GACAAAGAGGAGGATGAGATAGTGAAAATTCTGATAAAC AATGTTAATGT 
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1451 TGCCGAAGAACTTCC AGTCTTCCCTATAATTGGTATGGGGGGACTAGGAA 

1501 AG ACG ACACTTGCCC AAATGATCTTCAACGATGAG AG AGTAACTAAGC AT 

.1551 TTC AATCCCAAAATATGGGTTTGTGTCTCAG ATGATTTTGATGAGAAGAG 

1601 GTTAATTAAGAC AATTATAGGAAATATTG AAAG AAGTTCTCCTCATGTTG 

1651 • AGGACTTGGCTTCATTTCAGAAGAAGCTCCAGGAGTTATTGAATGGAAAA 

17 01 CGATACTTGCTTGTCTTAGATGATGTTTGGAATGATGATCTAGAAAAGTG 

17 51 GGCTAAGTTAAGAGC AGTCTTAACTGTTGGAGC AAG AGGTGCTTCTATTC 

1801 TAGCTACTACTCGTCTTGAAAAGGTTGGATC AATTATGGGAACGTTGCAA 

1851 CCATATCATTTGTC AAATTTGTCTCCACATGATAGTTTACTTTTGTTTAT 

19 01 GCAACGCGC ATTTGGGCAACAAAAAGAAGCAAATCCTAATCTAGTGGCCA 

1951 TTGGAAAGGAGATTGTGAAG AAATGTGGTGGTGTGCCTTTAGCAGCCAAG 

2 001 ACTCTTGGTGGTCTTTTACGCTTCAAGAGAGAAGAGAGTGAATGGGAACA 

2051 TGTGAGAGATAATG AGATTTGGAGTCTGCCTC AAGATG AAAGTTCTATTT 

2101 TGCCTGCTCTAAGACTGAGTTATCATCACCTTCCACTTGATTTGAGACAA 

2151 TG C TTTGCGTATTGTGCAGTATTCCC AAAGG AC ACC AAAATG ATAAAGGA 

2201 AAATCTCATTACTCTCTGGATGGCGCATGGTTTTCTTTTATCAAAGGGAA 

22 51 ACTTGGAGCTAGAGGATGTGGGTAATGAAGTATGGAATGAATTATACTTG 

2 3 01 AGGTCTTTCTTCCAAGAAATTGAAGCTAAATCGGGTAATACTTATTTCAA 

2 3 51 GATACATGATCTAATCCATGATTTGGCTACATCTCTGTTTTCGGCAAGCG 

2401 C ATC ATGCGGC AATATCCGCGAAATAAATGTCAAAGATTATAAGCATACA 

24 51 GTGTCCATTGGTTTCGCTGCAGTGGTGTCTTCTTACTCTCCTTCGCTCTT 
2501 G AAAAAGTTTGTCTCGTTAAGGGTGCTTAATCTAAGTTACTC AAAACTTG 

25 51 AGCAATTACCGTCTTCCATTGGAGATCTATTACATTTAAGATACCTGGAC 
2 6 0 1 CTGTCTTGC AATAACTTCCGTAGTCTTCCAGAGAGGTTGTGC AAGCTTCA 

26 51 AAATCTTC AGACTCTTGATGTACATAATTGCTACTCACTTAATTGTTTGC 

27 01 CAAAAC AAACAAGTAAACTTAGTAGTCTCCGACATCTTGTTGTTGATGGC 

27 51 TG T C C ATTG AC TT C T ACTC C AC C AAGG A T AG G ATTG TTG AC ATG CCTT AA 

28 01 GACTCTAGGTTTCTTTATTGTGGGAAGCAAGAAAGGTTATCAACTTGGTG 

28 51 AACTGAAAAACCTAAATCTCTGCGGCTCAATTTCAATCACACACCTTGAG 

29 01 AGAGTGAAGAACGATACGGATGCAGAAGCCAATTTATCTGCAAAAGCAAA 
2951 TCTGC AATCTTTAAGC ATGAGTTGGGATAACGATGGACCAAACAGATATG 
3001 AATCC AAAG AAG TTAAAGTGCTTG AAG C ACTC AAA CCACACCCCAATCTG 
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3 051 AAATATTTAGAGATCATTGCCTTCGGAGGATTCCGTTTTCCAAGCTGGAT 

3101 AAATCACTCAGTTTTGGAGAAGGTCATCTCTGTTAGAATTAAAAGCTGCA 

3151 AAAACTGCTTGTGCTTACCACCCTTTGGGGAGCTTCCTTGTCTAGAAAAT 

32 01 CTAGAGTTAC AAAACGGATCTGCGGAGGTGGAGTATGTTGAAGAGGATGA 

32 51 TGTCCATTCTAGATTCTCCACAAGAAGAAGCTTTCCATCCCTGAAAAAAC 

3 3 01 TTC GTATATGGTTCTTTCGCAGTTTG AAAGGGCTGATGAAAGAGGAAGGA 

3 3 51 GAAGAGAAATTCCCCATGCTTGAAGAGATGGCGATTTTATATTGCCCTCT 

3401 GTTTGTTTTTCCAACCCTTTCTTCTGTCAAGAAATTAGAAGTTC ACGGCA 

3 4 51 ACACAAACACTAGAGGTTTGAGCTCC ATATCTAATCTTAGCACTCTTACT 

3 501 TCCCTCCGCATTGGTGCTAACTACAG AGCGACTTCACTCCCAGAAGAGAT 

3 5 51 GTTCACAAGTCTTACAAATCTCGAATTCTTGAGTTTCTTTGACTTC AAGA 

3 601 ATCTCAAAGATCTGCCTACCAGCCTGACTAGTCTCAATGCTTTGAAGCGT 

3 651 CTCC AAATTGAAAGTTGTGACTCACTAGAGAGTTTCCCTGAAC AAGGGCT 

3 7 01 AG AAG GT TT AAC TTC ACTC AC AC AGTTGTTTG TTAAAT ACTGT AAG ATGC 

37 51 TAAAATGTTTACCCGAGGGATTGCAGCACCTAACAGCCCTCACAAATTTA 

3 801 GGAGTTTCTGGTTGTCC AGAAGTGGAAAAGCGCTGTGATAAGGAAATAGG 

3 851 AGAAGACTGGC AC AAAATTGCTCACATTCCAAATCTGGATATTCATTAG 
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MAEAFIQVLL,DNLTSFLKGELVLiFGFQDE£QRLSSM£STIQAVLiEDAQEKQLNN 55 

KPLENWLOKLNAATYEVDDILDEyKTKATRFSQSEYGRYHPKVIPFRHKVGKKMD 110 

QVMKKLKAIAEERKNFHLHEKIVERQAVRRETGSVLTEPQVYGRDKEKDEIVKIL 165 

INNVSDAQHLSVLpilgm^gJgJct tlaQMVFNDQRVTEHFHSKIWICVSEDFDEK 220 

RLIKAIVESlEGRPLLGEMDLAPLQKKLQELLNGIcryl lvl dd vwNEDQQKW AN L 275 

RAVLKVGASGAs vl t tt rLEKVGSIMGTLQPYELSNLSQEtXrWLLFMQRAFGHQE 33 0 

EINPNLVAIGKEIVKKSGGVPLAAKTLGGILCFKREERAWEHVRDSPIWNLPQDE 3 85 

SSILPALRLSYHQLPLDLKQCFAYCAVFPKDAKMEKEKLISLWMAHGFLLSKGNM 440 

ELEDVGDEVWKELYLRSFFQEIEVKDGKTYFlOnhdlihdJatSLFSANTSSSNIR 49 5 

EINKHS 50 1 

YTHMMSXGFAEWFPYTIiPPLEK 52 4 

FISI-RVt*NIiGDST.FNKIiI»SSIGD 547 

LVHLRYLNLYGSG . MRSLPKQLCK 570 

I*QN1,QT3LDI^YCTKI*CCLPKETSK 594 

LGSLRNLLLDGSQSX.TCMP PRIGS 618 

IiTCI*KTLGQFWGRKKGYQ 6 3 7 

U3ELGNLNLYGSIKISHLERVKNDKDAKEANLSA 671 

KGNLHSI.SMS V3NNFGPHI YESEEVKVLEALKP 7 03 

HSNLTSt-KIYGFRGIH . IjPEWMNHSV 7 2 8 

LKNIVSILISNFRNCSCLPPFGD 7 51 

LPCt*ESLELHWGSAD 766 

VEYVEEVDIDVHSGTPTRIR 786 

FPSIiRKLDIWDFGSLKGLLKKEGEEQ 812 

FPVLEEMIIHECPFLTLSSN 83 2 

LRAiTSLRIC YNKVATSFPEEMFKN 857 

I/ANIiKYIfTI SRCNNLKEI-PTSIiAS 881 

LNALKSLKIQLCC ALESLPEEGLEG 906 

I#S SLTELFVEHCNMIjKCIiP EGLQH 93 0 

LTTLTSLKI RGC pqjj IKRC EKG IG EDWHK 9 59 

ISHIPNVNIYI 970 
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